• DocumentCode
    3408132
  • Title

    YouTubeCat: Learning to categorize wild web videos

  • Author

    Wang, Zheshen ; Zhao, Ming ; Song, Yang ; Kumar, Sanjiv ; Li, Baoxin

  • Author_Institution
    Comput. Sci. & Eng., Arizona State Univ., Tempe, AZ, USA
  • fYear
    2010
  • fDate
    13-18 June 2010
  • Firstpage
    879
  • Lastpage
    886
  • Abstract
    Automatic categorization of videos in a Web-scale unconstrained collection such as YouTube is a challenging task. A key issue is how to build an effective training set in the presence of missing, sparse or noisy labels. We propose to achieve this by first manually creating a small labeled set and then extending it using additional sources such as related videos, searched videos, and text-based webpages. The data from such disparate sources has different properties and labeling quality, and thus fusing them in a coherent fashion is another practical challenge. We propose a fusion framework in which each data source is first combined with the manually-labeled set independently. Then, using the hierarchical taxonomy of the categories, a Conditional Random Field (CRF) based fusion strategy is designed. Based on the final fused classifier, category labels are predicted for the new videos. Extensive experiments on about 80K videos from 29 most frequent categories in YouTube show the effectiveness of the proposed method for categorizing large-scale wild Web videos.
  • Keywords
    Internet; Web sites; image classification; information retrieval; learning (artificial intelligence); random processes; text analysis; video signal processing; Web-scale unconstrained collection; YouTubeCat; automatic video categorization; category labels; coherent fashion; conditional random field; final fused classifier; fusion strategy; hierarchical taxonomy; labeling quality; manually-labeled set; related videos; searched videos; text-based webpages; wild Web videos; Computer science; Data analysis; Indexing; Labeling; Large-scale systems; Online services; Taxonomy; Training data; Videos; YouTube;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-6984-0
  • Type

    conf

  • DOI
    10.1109/CVPR.2010.5540125
  • Filename
    5540125