Title :
Bags of Spacetime Energies for Dynamic Scene Recognition
Author :
Feichtenhofer, Christoph ; Pinz, Axel ; Wildes, Richard P.
Author_Institution :
Inst. of Electr. Meas. & Meas. Signal Process., Tech. Univ. Graz, Graz, Austria
Abstract :
This paper presents a unified bag of visual word (BoW) framework for dynamic scene recognition. The approach builds on primitive features that uniformly capture spatial and temporal orientation structure of the imagery (e.g., video), as extracted via application of a bank of spatiotemporally oriented filters. Various feature encoding techniques are investigated to abstract the primitives to an intermediate representation that is best suited to dynamic scene representation. Further, a novel approach to adaptive pooling of the encoded features is presented that captures spatial layout of the scene even while being robust to situations where camera motion and scene dynamics are confounded. The resulting overall approach has been evaluated on two standard, publically available dynamic scene datasets. The results show that in comparison to a representative set of alternatives, the proposed approach outperforms the previous state-of-the-art in classification accuracy by 10%.
Keywords :
feature extraction; image classification; image coding; image motion analysis; image representation; object recognition; BoW framework; adaptive encoded feature pooling; bag of visual word framework; camera motion; dynamic scene recognition; dynamic scene representation; feature encoding techniques; scene dynamics; spacetime energies; spatial orientation structure; spatiotemporally oriented filters; temporal orientation structure; Dynamics; Encoding; Feature extraction; Image color analysis; Spatiotemporal phenomena; Three-dimensional displays; Visualization; feature extraction; image classification; natural scenes; video recognition;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location :
Columbus, OH
DOI :
10.1109/CVPR.2014.343