• DocumentCode
    1443005
  • Title

    View-Independent Action Recognition from Temporal Self-Similarities

  • Author

    Junejo, Imran N. ; Dexter, Emilie ; Laptev, Ivan ; Pérez, Patrick

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Sharjah, Sharjah, United Arab Emirates
  • Volume
    33
  • Issue
    1
  • fYear
    2011
  • Firstpage
    172
  • Lastpage
    185
  • Abstract
    This paper addresses recognition of human actions under view changes. We explore self-similarities of action sequences over time and observe the striking stability of such measures across views. Building upon this key observation, we develop an action descriptor that captures the structure of temporal similarities and dissimilarities within an action sequence. Despite this temporal self-similarity descriptor not being strictly view-invariant, we provide intuition and experimental validation demonstrating its high stability under view changes. Self-similarity descriptors are also shown to be stable under performance variations within a class of actions when individual speed fluctuations are ignored. If required, such fluctuations between two different instances of the same action class can be explicitly recovered with dynamic time warping, as will be demonstrated, to achieve cross-view action synchronization. More central to the current work, temporal ordering of local self-similarity descriptors can simply be ignored within a bag-of-features type of approach. Sufficient action discrimination is still retained in this way to build a view-independent action recognition system. Interestingly, self-similarities computed from different image features possess similar properties and can be used in a complementary fashion. Our method is simple and requires neither structure recovery nor multiview correspondence estimation. Instead, it relies on weak geometric properties and combines them with machine learning for efficient cross-view action recognition. The method is validated on three public data sets. It has similar or superior performance compared to related methods and it performs well even in extreme conditions, such as when recognizing actions from top views while using side views only for training.
  • Keywords
    computational geometry; feature extraction; image sequences; learning (artificial intelligence); action descriptor; action discrimination; action sequence; bag of features type approach; cross view action synchronization; dynamic time warping; image feature; machine learning; multiview correspondence estimation; public data set; striking stability; temporal ordering; temporal self similarity; view independent action recognition; weak geometric property; Buildings; Cameras; Fluctuations; Hidden Markov models; Humans; Machine learning; Shape; Stability; Support vector machines; Time measurement; Human action recognition; human action synchronization; local temporal descriptors.; temporal self-similarities; view invariance; Algorithms; Artificial Intelligence; Computer Simulation; Humans; Movement; Pattern Recognition, Automated;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2010.68
  • Filename
    5432213