Title :
Aggregating Local Image Descriptors into Compact Codes
Author :
Jegou, Herve ; Perronnin, Florent ; Douze, Matthijs ; Sanchez, Javier ; Perez, Pablo ; Schmid, Cordelia
Author_Institution :
INRIA, Rennes, France
Abstract :
This paper addresses the problem of large-scale image search. Three constraints have to be taken into account: search accuracy, efficiency, and memory usage. We first present and evaluate different ways of aggregating local image descriptors into a vector and show that the Fisher kernel achieves better performance than the reference bag-of-visual words approach for any given vector dimension. We then jointly optimize dimensionality reduction and indexing in order to obtain a precise vector comparison as well as a compact representation. The evaluation shows that the image representation can be reduced to a few dozen bytes while preserving high accuracy. Searching a 100 million image data set takes about 250 ms on one processor core.
Keywords :
image representation; image retrieval; indexing; vectors; Fisher kernel; compact codes; compact representation; dimensionality reduction optimization; image representation; indexing optimization; large-scale image search; local image descriptor aggregation; memory usage; reference bag-of-visual words approach; search accuracy; search efficiency; time 250 ms; vector comparison; vector dimension; Accuracy; Image representation; Indexing; Kernel; Vectors; Visualization; Image search; image retrieval; indexing.;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2011.235