Title :
Event Recognition in Videos by Learning from Heterogeneous Web Sources
Author :
Lin Chen ; Lixin Duan ; Xu, D. ; Dong Xu
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
Abstract :
In this work, we propose to leverage a large number of loosely labeled web videos (e.g., from YouTube) and web images (e.g., from Google/Bing image search) for visual event recognition in consumer videos without requiring any labeled consumer videos. We formulate this task as a new multi-domain adaptation problem with heterogeneous sources, in which the samples from different source domains can be represented by different types of features with different dimensions (e.g., the SIFT features from web images and space-time (ST) features from web videos) while the target domain samples have all types of features. To effectively cope with the heterogeneous sources where some source domains are more relevant to the target domain, we propose a new method called Multi-domain Adaptation with Heterogeneous Sources (MDA-HS) to learn an optimal target classifier, in which we simultaneously seek the optimal weights for different source domains with different types of features as well as infer the labels of unlabeled target domain data based on multiple types of features. We solve our optimization problem by using the cutting-plane algorithm based on group based multiple kernel learning. Comprehensive experiments on two datasets demonstrate the effectiveness of MDA-HS for event recognition in consumer videos.
Keywords :
Internet; image classification; image recognition; learning (artificial intelligence); optimisation; video signal processing; MDA-HS method; Web images; consumer videos; cutting-plane algorithm; group based multiple kernel learning; heterogeneous Web sources; loosely labeled Web videos; multidomain adaptation with heterogeneous sources method; optimal target classifier; optimization problem; unlabeled target domain data labels; visual event recognition; Image recognition; Kernel; Optimization; Training; Vectors; Videos; Visualization; Domain Adaptation; Event Recognition;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
Conference_Location :
Portland, OR
DOI :
10.1109/CVPR.2013.344