مرکز منطقه ای اطلاع رساني علوم و فناوري - Event Recognition in Videos by Learning from Heterogeneous Web Sources

DocumentCode :

639502

Title :

Event Recognition in Videos by Learning from Heterogeneous Web Sources

Author :

Lin Chen ; Lixin Duan ; Xu, D. ; Dong Xu

Author_Institution :

Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore

fYear :

2013

fDate :

23-28 June 2013

Firstpage :

2666

Lastpage :

2673

Abstract :

In this work, we propose to leverage a large number of loosely labeled web videos (e.g., from YouTube) and web images (e.g., from Google/Bing image search) for visual event recognition in consumer videos without requiring any labeled consumer videos. We formulate this task as a new multi-domain adaptation problem with heterogeneous sources, in which the samples from different source domains can be represented by different types of features with different dimensions (e.g., the SIFT features from web images and space-time (ST) features from web videos) while the target domain samples have all types of features. To effectively cope with the heterogeneous sources where some source domains are more relevant to the target domain, we propose a new method called Multi-domain Adaptation with Heterogeneous Sources (MDA-HS) to learn an optimal target classifier, in which we simultaneously seek the optimal weights for different source domains with different types of features as well as infer the labels of unlabeled target domain data based on multiple types of features. We solve our optimization problem by using the cutting-plane algorithm based on group based multiple kernel learning. Comprehensive experiments on two datasets demonstrate the effectiveness of MDA-HS for event recognition in consumer videos.

Keywords :

Internet; image classification; image recognition; learning (artificial intelligence); optimisation; video signal processing; MDA-HS method; Web images; consumer videos; cutting-plane algorithm; group based multiple kernel learning; heterogeneous Web sources; loosely labeled Web videos; multidomain adaptation with heterogeneous sources method; optimal target classifier; optimization problem; unlabeled target domain data labels; visual event recognition; Image recognition; Kernel; Optimization; Training; Vectors; Videos; Visualization; Domain Adaptation; Event Recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on

Conference_Location :

Portland, OR

ISSN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2013.344

Filename :

6619188

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=639502