DocumentCode
254111
Title
Depth and Skeleton Associated Action Recognition without Online Accessible RGB-D Cameras
Author
Yen-Yu Lin ; Ju-Hsuan Hua ; Tang, Nick C. ; Min-Hung Chen ; Liao, Hong-Yuan Mark
Author_Institution
Acad. Sinica, Taipei, Taiwan
fYear
2014
fDate
23-28 June 2014
Firstpage
2617
Lastpage
2624
Abstract
The recent advances in RGB-D cameras have allowed us to better solve increasingly complex computer vision tasks. However, modern RGB-D cameras are still restricted by the short effective distances. The limitation may make RGB-D cameras not online accessible in practice, and degrade their applicability. We propose an alternative scenario to address this problem, and illustrate it with the application to action recognition. We use Kinect to offline collect an auxiliary, multi-modal database, in which not only the RGB videos but also the depth maps and skeleton structures of actions of interest are available. Our approach aims to enhance action recognition in RGB videos by leveraging the extra database. Specifically, it optimizes a feature transformation, by which the actions to be recognized can be concisely reconstructed by entries in the auxiliary database. In this way, the inter-database variations are adapted. More importantly, each action can be augmented with additional depth and skeleton images retrieved from the auxiliary database. The proposed approach has been evaluated on three benchmarks of action recognition. The promising results manifest that the augmented depth and skeleton features can lead to remarkable boost in recognition accuracy.
Keywords
computer vision; image recognition; Kinect; RGB videos; augmented depth; auxiliary database; complex computer vision tasks; depth associated action recognition; depth images; depth maps; feature transformation; inter-database variations; multimodal database; skeleton associated action recognition; skeleton features; skeleton images; skeleton structures; Cameras; Computer vision; Databases; Kernel; Optimization; Skeleton; Videos;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location
Columbus, OH
Type
conf
DOI
10.1109/CVPR.2014.335
Filename
6909731
Link To Document