Title :
View and scale insensitive action representation and recognition
Author :
Cao Yuanyuan ; Huang Feiyue ; Tao Linmi ; Xu Guangyou
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Abstract :
In this paper a view and scale insensitive action representation VSI-Surf is proposed. Scale invariant shape descriptor R-transform is used to extract compact 1D feature from view insensitive posture representation “Envelop shape” which uses only two orthogonal cameras without accurate calibration. Considering action is a posture sequence, to integrate temporal information, 1D posture feature is then extended in time dimension. Then we get an action representation insensitive to viewpoint and scale, which is called VSI-Surf. Actions recognition is processed in a hierarchical framework, in which body actions and gestures are recognized in different level. Encouraging recognition results have been demonstrated on the multi-view IXMAS action dataset.
Keywords :
Radon transforms; cameras; feature extraction; gesture recognition; IXMAS action dataset; Radon transform; VSI-Surf; compact 1D feature extraction; envelop shape; gesture recognition; insensitive posture representation; orthogonal camera; scale insensitive action recognition; scale insensitive action representation; scale invariant shape descriptor R-transform; temporal information; time dimension; Calibration; Cameras; Computer science; Computer vision; Data mining; Face detection; Face recognition; Feature extraction; Image recognition; Shape measurement; Viewpoint insensitive; action recognition; action representation;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495361