• DocumentCode
    595555
  • Title

    Sparse shift-invariant representation of local 2D patterns and sequence learning for human action recognition

  • Author

    Baccouche, Moez ; Mamalet, F. ; Wolf, Christian ; Garcia, Christophe ; Baskurt, A.

  • Author_Institution
    Orange Labs. R&D, France
  • fYear
    2012
  • fDate
    11-15 Nov. 2012
  • Firstpage
    3823
  • Lastpage
    3826
  • Abstract
    Most existing methods for action recognition mainly rely on manually engineered features which, despite their good performances, are highly problem dependent. We propose in this paper a fully automated model, which learns to classify human actions without using any prior knowledge. A convolutional sparse autoencoder learns to extract sparse shift-invariant representations of the 2D local patterns present in each video frame. The evolution of these mid-level features is learned by a Recurrent Neural Network trained to classify each sequence. Experimental results on the KTH dataset show that the proposed approach outperforms existing models which rely on learned-features, and gives comparable results with the best related works.
  • Keywords
    feature extraction; gesture recognition; image classification; image representation; image sequences; learning (artificial intelligence); recurrent neural nets; 2D local patterns; KTH dataset; convolutional sparse autoencoder; human action recognition; learned-features; manually engineered features; midlevel features; recurrent neural network; sequence classification; sequence learning; sparse shift-invariant local 2D pattern representation; video frame; Convolutional codes; Decoding; Feature extraction; Humans; Pattern recognition; Recurrent neural networks; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2012 21st International Conference on
  • Conference_Location
    Tsukuba
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4673-2216-4
  • Type

    conf

  • Filename
    6460998