DocumentCode :
15579
Title :
Scalable Similarity Search With Topology Preserving Hashing
Author :
Lei Zhang ; Yongdong Zhang ; Xiaoguang Gu ; Jinhui Tang ; Qi Tian
Author_Institution :
Key Lab. of Intell. Inf. Process., Inst. of Comput. Technol., Beijing, China
Volume :
23
Issue :
7
fYear :
2014
fDate :
Jul-14
Firstpage :
3025
Lastpage :
3039
Abstract :
Hashing-based similarity search techniques is becoming increasingly popular in large data sets. To capture meaningful neighbors, the topology of a data set, which represents the neighborhood relationships between its subregions and the relative proximities between the neighbors of each subregion, e.g., the relative neighborhood ranking of each subregion, should be exploited. However, most existing hashing methods are developed to preserve neighborhood relationships while ignoring the relative neighborhood proximities. Moreover, most hashing methods lack in providing a good result ranking, since there are often lots of results sharing the same Hamming distance to a query. In this paper, we propose a novel hashing method to solve these two issues jointly. The proposed method is referred to as topology preserving hashing (TPH). TPH is distinct from prior works by also preserving the neighborhood ranking. Based on this framework, we present three different TPH methods, including linear unsupervised TPH, semisupervised TPH, and kernelized TPH. Particularly, our unsupervised TPH is capable of mining semantic relationship between unlabeled data without supervised information. Extensive experiments on four large data sets demonstrate the superior performances of the proposed methods over several state-of-the-art unsupervised and semisupervised hashing techniques.
Keywords :
cryptography; file organisation; Hamming distance; data set topology; hashing-based similarity search techniques; large data sets; scalable similarity search; topology preserving hashing; Hamming distance; Kernel; Manifolds; Measurement; Optimization; Semantics; Topology; Similarity search; approximate nearest neighbor search; binary hashing; topology preserving hashing;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2014.2326010
Filename :
6819420
Link To Document :
بازگشت