DocumentCode
1759541
Title
The Inverted Multi-Index
Author
Babenko, Artem ; Lempitsky, Victor
Author_Institution
Yandex, Moscow, Russia
Volume
37
Issue
6
fYear
2015
fDate
June 1 2015
Firstpage
1247
Lastpage
1260
Abstract
A new data structure for efficient similarity search in very large datasets of high-dimensional vectors is introduced. This structure called the inverted multi-index generalizes the inverted index idea by replacing the standard quantization within inverted indices with product quantization. For very similar retrieval complexity and pre-processing time, inverted multi-indices achieve a much denser subdivision of the search space compared to inverted indices, while retaining their memory efficiency. Our experiments with large datasets of SIFT and GIST vectors demonstrate that because of the denser subdivision, inverted multi-indices are able to return much shorter candidate lists with higher recall. Augmented with a suitable reranking procedure, multi-indices were able to significantly improve the speed of approximate nearest neighbor search on the dataset of 1 billion SIFT vectors compared to the best previously published systems, while achieving better recall and incurring only few percent of memory overhead.
Keywords
Accuracy; Computer vision; Indexes; Nearest neighbor searches; Quantization (signal); Standards; Vectors; Image retrieval; Index; Nearest neighbor search; nearest neighbor search; product quantization;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2014.2361319
Filename
6915715
Link To Document