DocumentCode
76063
Title
Body Surface Context: A New Robust Feature for Action Recognition From Depth Videos
Author
Yan Song ; Jinhui Tang ; Fan Liu ; Shuicheng Yan
Author_Institution
Sch. of Comput. Sci. & Technol., Nanjing Univ. of Sci. & Technol., Nanjing, China
Volume
24
Issue
6
fYear
2014
fDate
Jun-14
Firstpage
952
Lastpage
964
Abstract
Human action recognition in videos is useful for many applications. However, there still exist huge challenges in real applications due to the variations in the appearance, lighting condition and viewing angle, of the subjects. In this consideration, depth data have advantages over red, green, blue (RGB) data because of their spatial information about the distance between object and viewpoint. Unlike existing works, we utilize the 3-D point cloud, which contains points in the 3-D real-world coordinate system to represent the external surface of human body. Specifically, we propose a new robust feature, the body surface context (BSC), by describing the distribution of relative locations of the neighbors for a reference point in the point cloud in a compact and descriptive way. The BSC encodes the cylindrical angular of the difference vector based on the characteristics of human body, which increases the descriptiveness and discriminability of the feature. As the BSC is an approximate object-centered feature, it is robust to transformations including translations and rotations, which are very common in real applications. Furthermore, we propose three schemes to represent human actions based on the new feature, including the skeleton-based scheme, the random-reference-point scheme, and the spatial-temporal scheme. In addition, to evaluate the proposed feature, we construct a human action dataset by a depth camera. Experiments on three datasets demonstrate that the proposed feature outperforms RGB-based features and other existing depth-based features, which validates that the BSC feature is promising in the field of human action recognition.
Keywords
image motion analysis; image representation; object recognition; video signal processing; 3D point cloud; 3D real-world coordinate system; BSC; body surface context; depth camera; depth data; depth videos; difference vector; human action dataset; human action recognition; random reference point scheme; skeleton based scheme; spatial temporal scheme; Context; Feature extraction; Joints; Shape; Three-dimensional displays; Vectors; Depth video; Human action recognition; depth video; feature; human action recognition; point cloud;
fLanguage
English
Journal_Title
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher
ieee
ISSN
1051-8215
Type
jour
DOI
10.1109/TCSVT.2014.2302558
Filename
6722961
Link To Document