Title :
View-Independent Action Recognition from Temporal Self-Similarities
Author :
Junejo, Imran N. ; Dexter, Emilie ; Laptev, Ivan ; Pérez, Patrick
Author_Institution :
Dept. of Comput. Sci., Univ. of Sharjah, Sharjah, United Arab Emirates
Abstract :
This paper addresses recognition of human actions under view changes. We explore self-similarities of action sequences over time and observe the striking stability of such measures across views. Building upon this key observation, we develop an action descriptor that captures the structure of temporal similarities and dissimilarities within an action sequence. Despite this temporal self-similarity descriptor not being strictly view-invariant, we provide intuition and experimental validation demonstrating its high stability under view changes. Self-similarity descriptors are also shown to be stable under performance variations within a class of actions when individual speed fluctuations are ignored. If required, such fluctuations between two different instances of the same action class can be explicitly recovered with dynamic time warping, as will be demonstrated, to achieve cross-view action synchronization. More central to the current work, temporal ordering of local self-similarity descriptors can simply be ignored within a bag-of-features type of approach. Sufficient action discrimination is still retained in this way to build a view-independent action recognition system. Interestingly, self-similarities computed from different image features possess similar properties and can be used in a complementary fashion. Our method is simple and requires neither structure recovery nor multiview correspondence estimation. Instead, it relies on weak geometric properties and combines them with machine learning for efficient cross-view action recognition. The method is validated on three public data sets. It has similar or superior performance compared to related methods and it performs well even in extreme conditions, such as when recognizing actions from top views while using side views only for training.
Keywords :
computational geometry; feature extraction; image sequences; learning (artificial intelligence); action descriptor; action discrimination; action sequence; bag of features type approach; cross view action synchronization; dynamic time warping; image feature; machine learning; multiview correspondence estimation; public data set; striking stability; temporal ordering; temporal self similarity; view independent action recognition; weak geometric property; Buildings; Cameras; Fluctuations; Hidden Markov models; Humans; Machine learning; Shape; Stability; Support vector machines; Time measurement; Human action recognition; human action synchronization; local temporal descriptors.; temporal self-similarities; view invariance; Algorithms; Artificial Intelligence; Computer Simulation; Humans; Movement; Pattern Recognition, Automated;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2010.68