DocumentCode
595555
Title
Sparse shift-invariant representation of local 2D patterns and sequence learning for human action recognition
Author
Baccouche, Moez ; Mamalet, F. ; Wolf, Christian ; Garcia, Christophe ; Baskurt, A.
Author_Institution
Orange Labs. R&D, France
fYear
2012
fDate
11-15 Nov. 2012
Firstpage
3823
Lastpage
3826
Abstract
Most existing methods for action recognition mainly rely on manually engineered features which, despite their good performances, are highly problem dependent. We propose in this paper a fully automated model, which learns to classify human actions without using any prior knowledge. A convolutional sparse autoencoder learns to extract sparse shift-invariant representations of the 2D local patterns present in each video frame. The evolution of these mid-level features is learned by a Recurrent Neural Network trained to classify each sequence. Experimental results on the KTH dataset show that the proposed approach outperforms existing models which rely on learned-features, and gives comparable results with the best related works.
Keywords
feature extraction; gesture recognition; image classification; image representation; image sequences; learning (artificial intelligence); recurrent neural nets; 2D local patterns; KTH dataset; convolutional sparse autoencoder; human action recognition; learned-features; manually engineered features; midlevel features; recurrent neural network; sequence classification; sequence learning; sparse shift-invariant local 2D pattern representation; video frame; Convolutional codes; Decoding; Feature extraction; Humans; Pattern recognition; Recurrent neural networks; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location
Tsukuba
ISSN
1051-4651
Print_ISBN
978-1-4673-2216-4
Type
conf
Filename
6460998
Link To Document