Title :
Tell me why! ain´t nothin´ but a mistake? describing media item differences with media fragments uri and speech synthesis
Author :
Steiner, Torsten ; Troncy, Raphael
Author_Institution :
Google Germany GmbH, Hamburg, Germany
Abstract :
We have developed a tile-wise histogram-based media item deduplication algorithm with additional high-level semantic matching criteria that is tailored to photos and videos gathered from multiple social networks. In this paper, we investigate whether the Media Fragments URI addressing scheme together with a natural language generation framework realized through a text-to-speech system provides a feasible and practicable way to visually and audially describe the differences between media items of type photo and/or video, so that human-friendly debugging of the deduplication algorithm is made possible. A short screencast illustrating the approach is available online at http://youtu.be/DWqwEnhqTSc.
Keywords :
multimedia computing; natural language processing; social networking (online); speech synthesis; high level semantic matching criteria; histogram based media item deduplication algorithm; media fragments URI addressing scheme; media item differences; natural language generation framework; photos; screencast; social networks; speech synthesis; text to speech system; videos; Clustering algorithms; Media; Natural languages; Social network services; Speech; Tiles; Videos; Deduplication; Media Fragments; Media Fragments URI; Media Items; Social Networks;
Conference_Titel :
Multimedia and Expo Workshops (ICMEW), 2013 IEEE International Conference on
Conference_Location :
San Jose, CA
DOI :
10.1109/ICMEW.2013.6618364