DocumentCode
270304
Title
Beyond “project and sign” for cosine estimation with binary codes
Author
Balu, Radhakrishnan ; Furon, Teddy ; JeÌgou, HerveÌ
fYear
2014
fDate
4-9 May 2014
Firstpage
68884
Lastpage
6888
Abstract
Many nearest neighbor search algorithms rely on encoding real vectors into binary vectors. The most common strategy projects the vectors onto random directions and takes the sign to produce so-called sketches. This paper discusses the sub-optimality of this choice, and proposes a better encoding strategy based on the quantization and reconstruction points of view. Our second contribution is a novel asymmetric estimator for the cosine similarity. Similar to previous asymmetric schemes, the query is not quantized and the similarity is computed in the compressed domain. Both our contribution leads to improve the quality of nearest neighbor search with binary codes. Its efficiency compares favorably against a recent encoding technique.
Keywords
binary codes; search problems; asymmetric estimator; binary codes; binary vectors; cosine estimation; cosine similarity; encoding strategy; nearest neighbor search algorithms; project and sign; random directions; Binary codes; Databases; Encoding; Estimation; Hamming distance; Quantization (signal); Vectors; Hamming embedding; Locality sensitive hashing; approximate nearest neighbors; similarity search;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854934
Filename
6854934
Link To Document