مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning View-Invariant Sparse Representations for Cross-View Action Recognition

DocumentCode :

3428517

Title :

Learning View-Invariant Sparse Representations for Cross-View Action Recognition

Author :

Jingjing Zheng ; Zhuolin Jiang

Author_Institution :

Univ. of Maryland, College Park, MD, USA

fYear :

2013

fDate :

1-8 Dec. 2013

Firstpage :

3176

Lastpage :

3183

Abstract :

We present an approach to jointly learn a set of view-specific dictionaries and a common dictionary for cross-view action recognition. The set of view-specific dictionaries is learned for specific views while the common dictionary is shared across different views. Our approach represents videos in each view using both the corresponding view-specific dictionary and the common dictionary. More importantly, it encourages the set of videos taken from different views of the same action to have similar sparse representations. In this way, we can align view-specific features in the sparse feature spaces spanned by the view-specific dictionary set and transfer the view-shared features in the sparse feature space spanned by the common dictionary. Meanwhile, the incoherence between the common dictionary and the view-specific dictionary set enables us to exploit the discrimination information encoded in view-specific features and view-shared features separately. In addition, the learned common dictionary not only has the capability to represent actions from unseen views, but also makes our approach effective in a semi-supervised setting where no correspondence videos exist and only a few labels exist in the target view. Extensive experiments using the multi-view IXMAS dataset demonstrate that our approach outperforms many recent approaches for cross-view action recognition.

Keywords :

dictionaries; gesture recognition; learning (artificial intelligence); common dictionary; cross-view action recognition; discrimination information; multiview IXMAS dataset; novel dictionary learning framework; semisupervised setting; sparse feature spaces; view-invariant sparse representations; view-shared features; view-specific dictionaries; view-specific features; Correlation; Dictionaries; Feature extraction; Joints; Optimization; Shape; Videos; action recognition; dictionary learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision (ICCV), 2013 IEEE International Conference on

Conference_Location :

Sydney, VIC

ISSN :

1550-5499

Type :

conf

DOI :

10.1109/ICCV.2013.394

Filename :

6751506

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3428517