DocumentCode
639041
Title
Tell me why! ain´t nothin´ but a mistake? describing media item differences with media fragments uri and speech synthesis
Author
Steiner, Torsten ; Troncy, Raphael
Author_Institution
Google Germany GmbH, Hamburg, Germany
fYear
2013
fDate
15-19 July 2013
Firstpage
1
Lastpage
6
Abstract
We have developed a tile-wise histogram-based media item deduplication algorithm with additional high-level semantic matching criteria that is tailored to photos and videos gathered from multiple social networks. In this paper, we investigate whether the Media Fragments URI addressing scheme together with a natural language generation framework realized through a text-to-speech system provides a feasible and practicable way to visually and audially describe the differences between media items of type photo and/or video, so that human-friendly debugging of the deduplication algorithm is made possible. A short screencast illustrating the approach is available online at http://youtu.be/DWqwEnhqTSc.
Keywords
multimedia computing; natural language processing; social networking (online); speech synthesis; high level semantic matching criteria; histogram based media item deduplication algorithm; media fragments URI addressing scheme; media item differences; natural language generation framework; photos; screencast; social networks; speech synthesis; text to speech system; videos; Clustering algorithms; Media; Natural languages; Social network services; Speech; Tiles; Videos; Deduplication; Media Fragments; Media Fragments URI; Media Items; Social Networks;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo Workshops (ICMEW), 2013 IEEE International Conference on
Conference_Location
San Jose, CA
Type
conf
DOI
10.1109/ICMEW.2013.6618364
Filename
6618364
Link To Document