Tell me why! ain´t nothin´ but a mistake? describing media item differences with media fragments uri and speech synthesis

Author

Steiner, Torsten ; Troncy, Raphael

Author_Institution

Google Germany GmbH, Hamburg, Germany

fYear

2013

fDate

15-19 July 2013

Firstpage

1

Lastpage

6

Abstract

We have developed a tile-wise histogram-based media item deduplication algorithm with additional high-level semantic matching criteria that is tailored to photos and videos gathered from multiple social networks. In this paper, we investigate whether the Media Fragments URI addressing scheme together with a natural language generation framework realized through a text-to-speech system provides a feasible and practicable way to visually and audially describe the differences between media items of type photo and/or video, so that human-friendly debugging of the deduplication algorithm is made possible. A short screencast illustrating the approach is available online at http://youtu.be/DWqwEnhqTSc.

Keywords

multimedia computing; natural language processing; social networking (online); speech synthesis; high level semantic matching criteria; histogram based media item deduplication algorithm; media fragments URI addressing scheme; media item differences; natural language generation framework; photos; screencast; social networks; speech synthesis; text to speech system; videos; Clustering algorithms; Media; Natural languages; Social network services; Speech; Tiles; Videos; Deduplication; Media Fragments; Media Fragments URI; Media Items; Social Networks;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia and Expo Workshops (ICMEW), 2013 IEEE International Conference on

Conference_Location

San Jose, CA

Type

conf

DOI

10.1109/ICMEW.2013.6618364

Filename

6618364