• DocumentCode
    270304
  • Title

    Beyond “project and sign” for cosine estimation with binary codes

  • Author

    Balu, Radhakrishnan ; Furon, Teddy ; Jégou, Hervé

  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    68884
  • Lastpage
    6888
  • Abstract
    Many nearest neighbor search algorithms rely on encoding real vectors into binary vectors. The most common strategy projects the vectors onto random directions and takes the sign to produce so-called sketches. This paper discusses the sub-optimality of this choice, and proposes a better encoding strategy based on the quantization and reconstruction points of view. Our second contribution is a novel asymmetric estimator for the cosine similarity. Similar to previous asymmetric schemes, the query is not quantized and the similarity is computed in the compressed domain. Both our contribution leads to improve the quality of nearest neighbor search with binary codes. Its efficiency compares favorably against a recent encoding technique.
  • Keywords
    binary codes; search problems; asymmetric estimator; binary codes; binary vectors; cosine estimation; cosine similarity; encoding strategy; nearest neighbor search algorithms; project and sign; random directions; Binary codes; Databases; Encoding; Estimation; Hamming distance; Quantization (signal); Vectors; Hamming embedding; Locality sensitive hashing; approximate nearest neighbors; similarity search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854934
  • Filename
    6854934