DocumentCode :
2291041
Title :
Fast realistic multi-action recognition using mined dense spatio-temporal features
Author :
Gilbert, Andrew ; Illingworth, John ; Bowden, Richard
Author_Institution :
CVSSP, Univ. of Surrey, Guildford, UK
fYear :
2009
fDate :
Sept. 29 2009-Oct. 2 2009
Firstpage :
925
Lastpage :
931
Abstract :
Within the field of action recognition, features and descriptors are often engineered to be sparse and invariant to transformation. While sparsity makes the problem tractable, it is not necessarily optimal in terms of class separability and classification. This paper proposes a novel approach that uses very dense corner features that are spatially and temporally grouped in a hierarchical process to produce an overcomplete compound feature set. Frequently reoccurring patterns of features are then found through data mining, designed for use with large data sets. The novel use of the hierarchical classifier allows real time operation while the approach is demonstrated to handle camera motion, scale, human appearance variations, occlusions and background clutter. The performance of classification, outperforms other state-of-the-art action recognition algorithms on the three datasets; KTH, multi-KTH, and Hollywood. Multiple action localisation is performed, though no groundtruth localisation data is required, using only weak supervision of class labels for each training sequence. The Hollywood dataset contain complex realistic actions from movies, the approach outperforms the published accuracy on this dataset and also achieves real time performance.
Keywords :
data mining; image recognition; very large databases; Hollywood dataset; background clutter; camera motion; class classification; class separability; data mining; fast realistic multi-action recognition; human appearance variations; large data sets; mined dense spatio-temporal features; multi-KTH dataset; very dense corner features; Cameras; Data mining; Detectors; Fires; Humans; Image motion analysis; Motion pictures; Object recognition; Optical films; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision, 2009 IEEE 12th International Conference on
Conference_Location :
Kyoto
ISSN :
1550-5499
Print_ISBN :
978-1-4244-4420-5
Electronic_ISBN :
1550-5499
Type :
conf
DOI :
10.1109/ICCV.2009.5459335
Filename :
5459335
Link To Document :
بازگشت