DocumentCode :
2490141
Title :
An evaluation of bags-of-words and spatio-temporal shapes for action recognition
Author :
De Campos, Teofilo ; Barnard, Mark ; Mikolajczyk, Krystian ; Kittler, Josef ; Yan, Fei ; Christmas, William ; Windridge, David
Author_Institution :
CVSSP, Univ. of Surrey, Guildford, UK
fYear :
2011
fDate :
5-7 Jan. 2011
Firstpage :
344
Lastpage :
351
Abstract :
Bags-of-visual-Words (BoW) and Spatio-Temporal Shapes (STS) are two very popular approaches for action recognition from video. The former (BoW) is an un-structured global representation of videos which is built using a large set of local features. The latter (STS) uses a single feature located on a region of interest (where the actor is) in the video. Despite the popularity of these methods, no comparison between them has been done. Also, given that BoW and STS differ intrinsically in terms of context inclusion and globality/locality of operation, an appropriate evaluation framework has to be designed carefully. This paper compares these two approaches using four different datasets with varied degree of space-time specificity of the actions and varied relevance of the contextual background. We use the same local feature extraction method and the same classifier for both approaches. Further to BoW and STS, we also evaluated novel variations of BoW constrained in time or space. We observe that the STS approach leads to better results in all datasets whose background is of little relevance to action classification.
Keywords :
motion estimation; shape recognition; BoW; STS; action recognition; bags-of-words evaluation; spatio temporal shapes; Feature extraction; Games; Histograms; Kernel; Shape; Training; Video sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Computer Vision (WACV), 2011 IEEE Workshop on
Conference_Location :
Kona, HI
ISSN :
1550-5790
Print_ISBN :
978-1-4244-9496-5
Type :
conf
DOI :
10.1109/WACV.2011.5711524
Filename :
5711524
Link To Document :
بازگشت