• DocumentCode
    639041
  • Title

    Tell me why! ain´t nothin´ but a mistake? describing media item differences with media fragments uri and speech synthesis

  • Author

    Steiner, Torsten ; Troncy, Raphael

  • Author_Institution
    Google Germany GmbH, Hamburg, Germany
  • fYear
    2013
  • fDate
    15-19 July 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    We have developed a tile-wise histogram-based media item deduplication algorithm with additional high-level semantic matching criteria that is tailored to photos and videos gathered from multiple social networks. In this paper, we investigate whether the Media Fragments URI addressing scheme together with a natural language generation framework realized through a text-to-speech system provides a feasible and practicable way to visually and audially describe the differences between media items of type photo and/or video, so that human-friendly debugging of the deduplication algorithm is made possible. A short screencast illustrating the approach is available online at http://youtu.be/DWqwEnhqTSc.
  • Keywords
    multimedia computing; natural language processing; social networking (online); speech synthesis; high level semantic matching criteria; histogram based media item deduplication algorithm; media fragments URI addressing scheme; media item differences; natural language generation framework; photos; screencast; social networks; speech synthesis; text to speech system; videos; Clustering algorithms; Media; Natural languages; Social network services; Speech; Tiles; Videos; Deduplication; Media Fragments; Media Fragments URI; Media Items; Social Networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo Workshops (ICMEW), 2013 IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • Type

    conf

  • DOI
    10.1109/ICMEW.2013.6618364
  • Filename
    6618364