Title :
Semi-supervised hashing for scalable image retrieval
Author :
Wang, Jun ; Kumar, Sanjiv ; Chang, Shih-Fu
Author_Institution :
Columbia Univ. New York, Columbia, NY, USA
Abstract :
Large scale image search has recently attracted considerable attention due to easy availability of huge amounts of data. Several hashing methods have been proposed to allow approximate but highly efficient search. Unsupervised hashing methods show good performance with metric distances but, in image search, semantic similarity is usually given in terms of labeled pairs of images. There exist supervised hashing methods that can handle such semantic similarity but they are prone to overfitting when labeled data is small or noisy. Moreover, these methods are usually very slow to train. In this work, we propose a semi-supervised hashing method that is formulated as minimizing empirical error on the labeled data while maximizing variance and independence of hash bits over the labeled and unlabeled data. The proposed method can handle both metric as well as semantic similarity. The experimental results on two large datasets (up to one million samples) demonstrate its superior performance over state-of-the-art supervised and unsupervised methods.
Keywords :
file organisation; image retrieval; large scale image search; scalable image retrieval; semantic similarity; semi-supervised hashing method; unsupervised hashing method; Availability; Binary codes; Image databases; Image retrieval; Information retrieval; Large-scale systems; Nearest neighbor searches; Scalability; Videos; YouTube;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
978-1-4244-6984-0
DOI :
10.1109/CVPR.2010.5539994