DocumentCode :
2772100
Title :
P-packSVM: Parallel Primal grAdient desCent Kernel SVM
Author :
Zhu, Zeyuan Allen ; Chen, Weizhu ; Wang, Gang ; Zhu, Chenguang ; Chen, Zheng
Author_Institution :
Dept. of Phys., Tsinghua Univ., Beijing, China
fYear :
2009
fDate :
6-9 Dec. 2009
Firstpage :
677
Lastpage :
686
Abstract :
It is an extreme challenge to produce a nonlinear SVM classifier on very large scale data. In this paper we describe a novel P-packSVM algorithm that can solve the support vector machine (SVM) optimization problem with an arbitrary kernel. This algorithm embraces the best known stochastic gradient descent method to optimize the primal objective, and has 1/¿ dependency in complexity to obtain a solution of optimization error ¿. The algorithm can be highly parallelized with a special packing strategy, and experiences sub-linear speed-up with hundreds of processors. We demonstrate that P-packSVM achieves accuracy sufficiently close to that of SVM-light, and overwhelms the state-of-the-art parallel SVM trainer PSVM in both accuracy and efficiency. As an illustration, our algorithm trains CCAT dataset with 800 k samples in 13 minutes and 95% accuracy, while PSVM needs 5 hours but only has 92% accuracy. We at last demonstrate the capability of P-packSVM on 8 million training samples.
Keywords :
gradient methods; parallel processing; pattern classification; stochastic processes; support vector machines; CCAT dataset; P-packSVM algorithm; PSVM; SVM-light; arbitrary kernel; nonlinear SVM classifier; parallel primal gradient descent kernel SVM; special packing strategy; state-of-the-art parallel SVM trainer; stochastic gradient descent method; sublinear speed-up; support vector machine optimization; very large scale data; Asia; Computer science; Costs; Data mining; Kernel; Optimization methods; Physics; Stochastic processes; Support vector machine classification; Support vector machines; kernel; packing strategy; parallel; stochastic gradient descent; support vector machine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
Conference_Location :
Miami, FL
ISSN :
1550-4786
Print_ISBN :
978-1-4244-5242-2
Electronic_ISBN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2009.29
Filename :
5360294
Link To Document :
بازگشت