Title :
TubeTagger - YouTube-based Concept Detection
Author :
Ulges, Adrian ; Koch, Markus ; Borth, Damian ; Breuel, Thomas M.
Author_Institution :
IUPR Res. Group, German Res. Center for Artificial Intell. (DFKI) GmbH, Kaiserslautern, Germany
Abstract :
We present TubeTagger, a concept-based video retrieval system that exploits Web video as an information source. The system performs a visual learning on YouTube clips (i. e., it trains detectors for semantic concepts like "soccer" or "windmill"), and a semantic learning on the associated tags (i.e., relations between concepts like "swimming" and "water" are discovered). This way, a text-based video search free of manual indexing is realized. We present a quantitative study on Web-based concept detection comparing several features and statistical models on a large-scale dataset of YouTube content. Beyond this, we report several key findings related to concept learning from YouTube and its generalization to different domains, and illustrate certain characteristics of YouTube-learned concepts, like focus of interest and redundancy. To get a hands-on impression of Web-based concept detection, we invite researchers and practitioners to test our Web demo.
Keywords :
Internet; content management; content-based retrieval; data mining; query formulation; search engines; social networking (online); TubeTagger; Web based concept detection; Web video; YouTube based concept detection; YouTube clips; YouTube content large scale dataset; concept based video retrieval system; semantic learning; text based video search; visual learning; Data mining; Detectors; Focusing; Image databases; Indexing; Information retrieval; Large-scale systems; Training data; Vocabulary; YouTube;
Conference_Titel :
Data Mining Workshops, 2009. ICDMW '09. IEEE International Conference on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-5384-9
Electronic_ISBN :
978-0-7695-3902-7
DOI :
10.1109/ICDMW.2009.41