DocumentCode :
1326530
Title :
Spatiotemporal Localization and Categorization of Human Actions in Unsegmented Image Sequences
Author :
Oikonomopoulos, Antonios ; Patras, Ioannis ; Pantic, Maja
Author_Institution :
Dept. of Comput., Imperial Coll. London, London, UK
Volume :
20
Issue :
4
fYear :
2011
fDate :
4/1/2011 12:00:00 AM
Firstpage :
1126
Lastpage :
1140
Abstract :
In this paper we address the problem of localization and recognition of human activities in unsegmented image sequences. The main contribution of the proposed method is the use of an implicit representation of the spatiotemporal shape of the activity which relies on the spatiotemporal localization of characteristic ensembles of feature descriptors. Evidence for the spatiotemporal localization of the activity is accumulated in a probabilistic spatiotemporal voting scheme. The local nature of the proposed voting framework allows us to deal with multiple activities taking place in the same scene, as well as with activities in the presence of clutter and occlusion. We use boosting in order to select characteristic ensembles per class. This leads to a set of class specific codebooks where each codeword is an ensemble of features. During training, we store the spatial positions of the codeword ensembles with respect to a set of reference points, as well as their temporal positions with respect to the start and end of the action instance. During testing, each activated codeword ensemble casts votes concerning the spatiotemporal position and extend of the action, using the information that was stored during training. Mean Shift mode estimation in the voting space provides the most probable hypotheses concerning the localization of the subjects at each frame, as well as the extend of the activities depicted in the image sequences. We present classification and localization results for a number of publicly available datasets, and for a number of sequences where there is a significant amount of clutter and occlusion.
Keywords :
feature extraction; image sequences; spatiotemporal phenomena; clutter; codebook; codeword; feature extraction process; human action categorization; mean shift mode estimation; probabilistic spatiotemporal voting scheme; spatiotemporal localization; spatiotemporal shape; unsegmented image sequence; Clutter; Feature extraction; Humans; Image sequences; Shape; Spatiotemporal phenomena; Training; Action detection; space-time voting; Algorithms; Artificial Intelligence; Humans; Image Enhancement; Image Interpretation, Computer-Assisted; Movement; Pattern Recognition, Automated; Photography; Reproducibility of Results; Sensitivity and Specificity; Subtraction Technique; Video Recording;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2010.2076821
Filename :
5575423
Link To Document :
بازگشت