• DocumentCode
    2507807
  • Title

    Beyond “Near Duplicates”: Learning Hash Codes for Efficient Similar-Image Retrieval

  • Author

    Baluja, Shumeet ; Covell, Michele

  • Author_Institution
    Google Res., Google Inc., Mountain View, CA, USA
  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    543
  • Lastpage
    547
  • Abstract
    Finding similar images in a large database is an important, but often computationally expensive, task. In this paper, we present a two-tier similar-image retrieval system with the efficiency characteristics found in simpler systems designed to recognize near-duplicates. We compare the efficiency of lookups based on random projections and learned hashes to 100-times-more-frequent exemplar sampling. Both approaches significantly improve on the results from exemplar sampling, despite having significantly lower computational costs. Learned-hash keys provide the best result, in terms of both recall and efficiency.
  • Keywords
    file organisation; image retrieval; learning hash codes; near-duplicates; similar-image retrieval; Computational efficiency; Distributed databases; Entropy; Probes; Training; Training data; LSH; forgiving hashing; image retrieval; learned distances; two-tier retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.138
  • Filename
    5597439