DocumentCode :
248223
Title :
Advances on action recognition in videos using an interest point detector based on multiband spatio-temporal energies
Author :
Maninis, Kevis ; Koutras, Petros ; Maragos, Petros
Author_Institution :
Sch. of E.C.E., Nat. Tech. Univ. of Athens, Athens, Greece
fYear :
2014
fDate :
27-30 Oct. 2014
Firstpage :
1490
Lastpage :
1494
Abstract :
This paper proposes a new visual framework for action recognition in videos, that consists of an energy detector coupled with a carefully designed multiband energy based filterbank. The tracking of video energy is performed using perceptually inspired 3D Gabor filters combined with ideas from Dominant Energy Analysis. Within this framework, we utilize different alternatives such as non-linear energy operators where actions are implicitly considered as manifestations of spatio-temporal oscillations in the dynamic visual stream. Texture and motion decomposition of actions through multiband filtering is the basis of our approach. This new energy-based saliency measure of action videos leads to the extraction of local spatio-temporal interest points that give promising results for the task of action recognition. Such interest points are processed further in order to formulate a robust representation of an action in a video. Theoretical formulation is supported by evaluation in two popular action databases, in which our method seems to outperform the state of the art.
Keywords :
Gabor filters; channel bank filters; feature extraction; image filtering; image motion analysis; image texture; object detection; object recognition; object tracking; video signal processing; visual databases; 3D Gabor filters; action databases; action motion decomposition; action recognition; action texture decomposition; action videos; dominant energy analysis; dynamic visual stream; energy detector; energy-based saliency measure; interest point detector; local spatio-temporal interest point extraction; multiband energy based filterbank; multiband filtering; multiband spatio-temporal energies; nonlinear energy operators; spatio-temporal oscillations; video energy tracking; visual framework; Accuracy; Databases; Detectors; Frequency modulation; Three-dimensional displays; Videos; Visualization; Human action recognition; dominant energy analysis; energy tracking in videos; multiband Gabor filtering; spatio-temporal interest point detectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing (ICIP), 2014 IEEE International Conference on
Conference_Location :
Paris
Type :
conf
DOI :
10.1109/ICIP.2014.7025298
Filename :
7025298
Link To Document :
بازگشت