DocumentCode
2959643
Title
Ask the locals: Multi-way local pooling for image recognition
Author
Boureau, Y-Lan ; Roux, Nicolas Le ; Bach, Francis ; Ponce, Jean ; LeCun, Yann
Author_Institution
INRIA, Sophia Antipolis, France
fYear
2011
fDate
6-13 Nov. 2011
Firstpage
2651
Lastpage
2658
Abstract
Invariant representations in object recognition systems are generally obtained by pooling feature vectors over spatially local neighborhoods. But pooling is not local in the feature vector space, so that widely dissimilar features may be pooled together if they are in nearby locations. Recent approaches rely on sophisticated encoding methods and more specialized codebooks (or dictionaries), e.g., learned on subsets of descriptors which are close in feature space, to circumvent this problem. In this work, we argue that a common trait found in much recent work in image recognition or retrieval is that it leverages locality in feature space on top of purely spatial locality. We propose to apply this idea in its simplest form to an object recognition system based on the spatial pyramid framework, to increase the performance of small dictionaries with very little added engineering. State-of-the-art results on several object recognition benchmarks show the promise of this approach.
Keywords
encoding; feature extraction; image recognition; object recognition; vectors; codebooks; dissimilar features; encoding methods; feature vector space; feature vectors; image recognition; image retrieval; invariant representations; multiway local pooling; object recognition benchmarks; object recognition systems; purely spatial locality; spatial pyramid framework; spatially local neighborhoods; Benchmark testing; Dictionaries; Encoding; Feature extraction; Image coding; Training; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision (ICCV), 2011 IEEE International Conference on
Conference_Location
Barcelona
ISSN
1550-5499
Print_ISBN
978-1-4577-1101-5
Type
conf
DOI
10.1109/ICCV.2011.6126555
Filename
6126555
Link To Document