DocumentCode :
2719627
Title :
Locality-constrained and spatially regularized coding for scene categorization
Author :
Shabou, A. ; LeBorgne, H.
Author_Institution :
Vision & Content Eng. Lab., CEA, Gif-sur-Yvettes, France
fYear :
2012
fDate :
16-21 June 2012
Firstpage :
3618
Lastpage :
3625
Abstract :
Improving coding and spatial pooling for bag-of-words based feature design have gained a lot of attention in recent works addressing object recognition and scene classification. Regarding the coding step in particular, properties such as sparsity, locality and saliency have been investigated. The main contribution of this work consists in taking into acount the local spatial context of an image into the usual coding strategies proposed in the state-of-the-art. For this purpose, given an imgae, dense local features are extracted and structured in a lattice. The latter is endowed with a neighborhood system and pairwise interactions. We propose a new objective function to encode local features, which preserves locality constraints both in the feature space and the spatial domain of the image. In addition, an appropriate efficient optimization algorithm is provided, inspired from the graph-cut framework. In conjunction with the maximum-pooling operation and the spatial pyramid matching, that reflects a global spatial layout, the proposed method improves the performances of several state-of-the-art coding schemes for scene classification on three publicly available benchmarks (UIUC 8-sport, Scene-15 and Caltech-101).
Keywords :
feature extraction; graph theory; image classification; image coding; image matching; object recognition; optimisation; Caltech-101 benchmark; Scene-15 benchmark; UIUC 8-sport benchmark; bag-of-words based feature design; dense local feature extraction; graph-cut framework; local feature encoding; locality constraint preservation; locality-constrained coding; maximum-pooling operation; neighborhood system; object recognition; objective function; optimization algorithm; pairwise interactions; scene categorization; scene classification; spatial pyramid matching; spatially regularized coding; Encoding; Feature extraction; Image coding; Indexes; Optimization; Vectors; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4673-1226-4
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2012.6248107
Filename :
6248107
Link To Document :
بازگشت