DocumentCode
254105
Title
3D Pose from Motion for Cross-View Action Recognition via Non-linear Circulant Temporal Encoding
Author
Gupta, Arpan ; Martinez, Jose Luis ; Little, James J. ; Woodham, Robert J.
Author_Institution
Univ. of British Columbia, Vancouver, BC, Canada
fYear
2014
fDate
23-28 June 2014
Firstpage
2601
Lastpage
2608
Abstract
We describe a new approach to transfer knowledge across views for action recognition by using examples from a large collection of unlabelled mocap data. We achieve this by directly matching purely motion based features from videos to mocap. Our approach recovers 3D pose sequences without performing any body part tracking. We use these matches to generate multiple motion projections and thus add view invariance to our action recognition model. We also introduce a closed form solution for approximate non-linear Circulant Temporal Encoding (nCTE), which allows us to efficiently perform the matches in the frequency domain. We test our approach on the challenging unsupervised modality of the IXMAS dataset, and use publicly available motion capture data for matching. Without any additional annotation effort, we are able to significantly outperform the current state of the art.
Keywords
encoding; image matching; image motion analysis; image sequences; pose estimation; video signal processing; 3D pose sequence; IXMAS dataset unsupervised modality; closed form solution; cross view action recognition; multiple motion projection; nonlinear circulant temporal encoding; transfer knowledge; unlabelled mocap data; Databases; Encoding; Kernel; Three-dimensional displays; Training; Trajectory; Videos;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location
Columbus, OH
Type
conf
DOI
10.1109/CVPR.2014.333
Filename
6909729
Link To Document