YouTubeCat: Learning to categorize wild web videos

Author

Wang, Zheshen ; Zhao, Ming ; Song, Yang ; Kumar, Sanjiv ; Li, Baoxin

Author_Institution

Comput. Sci. & Eng., Arizona State Univ., Tempe, AZ, USA

fYear

2010

fDate

13-18 June 2010

Firstpage

879

Lastpage

886

Abstract

Automatic categorization of videos in a Web-scale unconstrained collection such as YouTube is a challenging task. A key issue is how to build an effective training set in the presence of missing, sparse or noisy labels. We propose to achieve this by first manually creating a small labeled set and then extending it using additional sources such as related videos, searched videos, and text-based webpages. The data from such disparate sources has different properties and labeling quality, and thus fusing them in a coherent fashion is another practical challenge. We propose a fusion framework in which each data source is first combined with the manually-labeled set independently. Then, using the hierarchical taxonomy of the categories, a Conditional Random Field (CRF) based fusion strategy is designed. Based on the final fused classifier, category labels are predicted for the new videos. Extensive experiments on about 80K videos from 29 most frequent categories in YouTube show the effectiveness of the proposed method for categorizing large-scale wild Web videos.

Keywords

Internet; Web sites; image classification; information retrieval; learning (artificial intelligence); random processes; text analysis; video signal processing; Web-scale unconstrained collection; YouTubeCat; automatic video categorization; category labels; coherent fashion; conditional random field; final fused classifier; fusion strategy; hierarchical taxonomy; labeling quality; manually-labeled set; related videos; searched videos; text-based webpages; wild Web videos; Computer science; Data analysis; Indexing; Labeling; Large-scale systems; Online services; Taxonomy; Training data; Videos; YouTube;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on

Conference_Location

San Francisco, CA

ISSN

1063-6919

Print_ISBN

978-1-4244-6984-0

Type

conf

DOI

10.1109/CVPR.2010.5540125

Filename

5540125