DocumentCode :
3408132
Title :
YouTubeCat: Learning to categorize wild web videos
Author :
Wang, Zheshen ; Zhao, Ming ; Song, Yang ; Kumar, Sanjiv ; Li, Baoxin
Author_Institution :
Comput. Sci. & Eng., Arizona State Univ., Tempe, AZ, USA
fYear :
2010
fDate :
13-18 June 2010
Firstpage :
879
Lastpage :
886
Abstract :
Automatic categorization of videos in a Web-scale unconstrained collection such as YouTube is a challenging task. A key issue is how to build an effective training set in the presence of missing, sparse or noisy labels. We propose to achieve this by first manually creating a small labeled set and then extending it using additional sources such as related videos, searched videos, and text-based webpages. The data from such disparate sources has different properties and labeling quality, and thus fusing them in a coherent fashion is another practical challenge. We propose a fusion framework in which each data source is first combined with the manually-labeled set independently. Then, using the hierarchical taxonomy of the categories, a Conditional Random Field (CRF) based fusion strategy is designed. Based on the final fused classifier, category labels are predicted for the new videos. Extensive experiments on about 80K videos from 29 most frequent categories in YouTube show the effectiveness of the proposed method for categorizing large-scale wild Web videos.
Keywords :
Internet; Web sites; image classification; information retrieval; learning (artificial intelligence); random processes; text analysis; video signal processing; Web-scale unconstrained collection; YouTubeCat; automatic video categorization; category labels; coherent fashion; conditional random field; final fused classifier; fusion strategy; hierarchical taxonomy; labeling quality; manually-labeled set; related videos; searched videos; text-based webpages; wild Web videos; Computer science; Data analysis; Indexing; Labeling; Large-scale systems; Online services; Taxonomy; Training data; Videos; YouTube;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
Conference_Location :
San Francisco, CA
ISSN :
1063-6919
Print_ISBN :
978-1-4244-6984-0
Type :
conf
DOI :
10.1109/CVPR.2010.5540125
Filename :
5540125
Link To Document :
بازگشت