• DocumentCode
    2718339
  • Title

    The inverted multi-index

  • Author

    Babenko, Artem ; Lempitsky, Victor

  • Author_Institution
    Yandex, Moscow, Russia
  • fYear
    2012
  • fDate
    16-21 June 2012
  • Firstpage
    3069
  • Lastpage
    3076
  • Abstract
    A new data structure for efficient similarity search in very large dataseis of high-dimensional vectors is introduced. This structure called the inverted multi-index generalizes the inverted index idea by replacing the standard quantization within inverted indices with product quantization. For very similar retrieval complexity and preprocessing time, inverted multi-indices achieve a much denser subdivision of the search space compared to inverted indices, while retaining their memory efficiency. Our experiments with large dataseis of SIFT and GIST vectors demonstrate that because of the denser subdivision, inverted multi-indices are able to return much shorter candidate lists with higher recall. Augmented with a suitable reranking procedure, multi-indices were able to improve the speed of approximate nearest neighbor search on the dataset of 1 billion SIFT vectors by an order of magnitude compared to the best previously published systems, while achieving better recall and incurring only few percent of memory overhead.
  • Keywords
    data structures; image retrieval; query formulation; GIST vectors; SIFT vectors; approximate nearest neighbor search; data structure; efficient similarity search; high dimensional vectors; inverted multi-index; inverted multi-indices; memory overhead; product quantization; retrieval complexity; search space; standard quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
  • Conference_Location
    Providence, RI
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4673-1226-4
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2012.6248038
  • Filename
    6248038