Title :
On Temporal Order Invariance for View-Invariant Action Recognition
Author :
Haq, A. ; Gondal, Iqbal ; Murshed, Manzur
Author_Institution :
Fac. of Inf. Technol., Monash Univ., Clayton, VIC, Australia
Abstract :
View-invariant action recognition is one of the most challenging problems in computer vision. Various representations are being devised for matching actions across different viewpoints to achieve view invariance. In this paper, we explore the invariance property of temporal order of action instances during action execution and utilize it for devising a new view-invariant action recognition approach. To ensure temporal order during matching, we utilize spatiotemporal features, feature fusion and temporal order consistency constraint. We start by extracting spatiotemporal cuboid features from video sequences and applying feature fusion to encapsulate within-class similarity for the same viewpoints. For each action class, we construct a feature fusion table to facilitate feature matching across different views. An action matching score is then calculated based on global temporal order constraint and number of matching features. Finally, the action label of the class with the maximum value of the matching score is assigned to the query action. Experimentation is performed on multiple view Inria Xmas motion acquisition sequences and West Virginia University action datasets, with encouraging results, that are comparable to the existing view-invariant action recognition techniques.
Keywords :
computer vision; feature extraction; gesture recognition; image fusion; image matching; image motion analysis; image sequences; spatiotemporal phenomena; video signal processing; Inria Xmas motion acquisition sequences; West Virginia University action datasets; action matching score; computer vision; feature fusion table; feature matching; global temporal order constraint; invariance property; query action; spatiotemporal cuboid feature extraction; spatiotemporal features; temporal order consistency constraint; temporal order invariance; video sequences; view-invariant action recognition techniques; within-class similarity; Cameras; Feature extraction; Principal component analysis; Spatiotemporal phenomena; Training; Video sequences; Visualization; Action recognition; feature fusion; space-time features; view invariance;
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
DOI :
10.1109/TCSVT.2012.2203213