Title :
Human action recognition using associated depth and skeleton information
Author :
Tang, Nick C. ; Yen-Yu Lin ; Ju-Hsuan Hua ; Ming-Fang Weng ; Liao, Hong-Yuan Mark
Author_Institution :
Inst. of Inf. Sci., Acad. Sinica, Taipei, Taiwan
Abstract :
The recent advances in imaging devices have opened the opportunity of better solving computer vision tasks. The next-generation cameras, such as the depth or binocular cameras, capture diverse information, and complement the conventional 2D RGB cameras. Thus, investigating the yielded multi-modal images generally facilitates the accomplishment of related applications. However, the limitations of these devices, such as short effective distances, expensive costs, or long response time, degrade their applicability in practical use. Addressing this problem in this work, we aim at action recognition in RGB videos with the aid of Kinect. We improve recognition accuracy by leveraging information derived from an offline collected database, in which not only the RGB but also the depth and skeleton images of actions are available. Our approach adapts the inter-database variations, and enables the sharing of visual knowledge across different image modalities. Each action instance for recognition in RGB representation is then augmented with the borrowed depth and skeleton features.
Keywords :
image colour analysis; image recognition; 2D RGB camera; Kinect; RGB representation; RGB video recognition; associated depth information; computer vision; human action recognition; interdatabase variation; multimodal imaging device; next-generation binocular camera; next-generation depth camera; offline collected database; skeleton information; Cameras; Computer vision; Databases; Kernel; Pattern recognition; Skeleton; Videos; Action recognition; Depth Association; Skeleton Association;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854475