DocumentCode :
253749
Title :
Hash-SVM: Scalable Kernel Machines for Large-Scale Visual Classification
Author :
Yadong Mu ; Gang Hua ; Wei Fan ; Shih-Fu Chang
fYear :
2014
fDate :
23-28 June 2014
Firstpage :
979
Lastpage :
986
Abstract :
This paper presents a novel algorithm which uses compact hash bits to greatly improve the efficiency of non-linear kernel SVM in very large scale visual classification problems. Our key idea is to represent each sample with compact hash bits, over which an inner product is defined to serve as the surrogate of the original nonlinear kernels. Then the problem of solving the nonlinear SVM can be transformed into solving a linear SVM over the hash bits. The proposed Hash-SVM enjoys dramatic storage cost reduction owing to the compact binary representation, as well as a (sub-)linear training complexity via linear SVM. As a critical component of Hash-SVM, we propose a novel hashing scheme for arbitrary non-linear kernels via random subspace projection in reproducing kernel Hilbert space. Our comprehensive analysis reveals a well behaved theoretic bound of the deviation between the proposed hashing-based kernel approximation and the original kernel function. We also derive requirements on the hash bits for achieving a satisfactory accuracy level. Several experiments on large-scale visual classification benchmarks are conducted, including one with over 1 million images. The results show that Hash-SVM greatly reduces the computational complexity (more than ten times faster in many cases) while keeping comparable accuracies.
Keywords :
Hilbert spaces; computational complexity; cost reduction; file organisation; pattern classification; support vector machines; arbitrary nonlinear kernel; compact binary representation; compact hash bits; comprehensive analysis; computational complexity; hash-SVM; hashing scheme; hashing-based kernel approximation; kernel Hilbert space; kernel function; large-scale visual classification benchmark; linear SVM; nonlinear kernel SVM; original nonlinear kernel; random subspace projection; scalable kernel machines; storage cost reduction; sublinear training complexity; very large scale visual classification problem; Approximation methods; Equations; Kernel; Memory management; Support vector machines; Training; Vectors; Kernel SVM; Locality sensitive hashing; random subspace;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location :
Columbus, OH
Type :
conf
DOI :
10.1109/CVPR.2014.130
Filename :
6909525
Link To Document :
بازگشت