Sparse shift-invariant representation of local 2D patterns and sequence learning for human action recognition

Author

Baccouche, Moez ; Mamalet, F. ; Wolf, Christian ; Garcia, Christophe ; Baskurt, A.

Author_Institution

Orange Labs. R&D, France

fYear

2012

fDate

11-15 Nov. 2012

Firstpage

3823

Lastpage

3826

Abstract

Most existing methods for action recognition mainly rely on manually engineered features which, despite their good performances, are highly problem dependent. We propose in this paper a fully automated model, which learns to classify human actions without using any prior knowledge. A convolutional sparse autoencoder learns to extract sparse shift-invariant representations of the 2D local patterns present in each video frame. The evolution of these mid-level features is learned by a Recurrent Neural Network trained to classify each sequence. Experimental results on the KTH dataset show that the proposed approach outperforms existing models which rely on learned-features, and gives comparable results with the best related works.

Keywords

feature extraction; gesture recognition; image classification; image representation; image sequences; learning (artificial intelligence); recurrent neural nets; 2D local patterns; KTH dataset; convolutional sparse autoencoder; human action recognition; learned-features; manually engineered features; midlevel features; recurrent neural network; sequence classification; sequence learning; sparse shift-invariant local 2D pattern representation; video frame; Convolutional codes; Decoding; Feature extraction; Humans; Pattern recognition; Recurrent neural networks; Training;

fLanguage

English

Publisher

ieee

Conference_Titel

Pattern Recognition (ICPR), 2012 21st International Conference on

Conference_Location

Tsukuba

ISSN

1051-4651

Print_ISBN

978-1-4673-2216-4

Type

conf

Filename

6460998