مرکز منطقه ای اطلاع رساني علوم و فناوري - NESVM: A Fast Gradient Method for Support Vector Machines

DocumentCode :

2208789

Title :

NESVM: A Fast Gradient Method for Support Vector Machines

Author :

Zhou, Tianyi ; Tao, Dacheng ; Wu, Xindong

Author_Institution :

Centre for Quantum Comput. & Intell. Syst., Univ. of Technol. Sydney, Sydney, NSW, Australia

fYear :

2010

fDate :

13-17 Dec. 2010

Firstpage :

679

Lastpage :

688

Abstract :

Support vector machines (SVMs) are invaluable tools for many practical applications in artificial intelligence, e.g., classification and event recognition. However, popular SVM solvers are not sufficiently efficient for applications with a great deal of samples as well as a large number of features. In this paper, thus, we present NESVM, a fast gradient SVM solver that can optimize various SVM models, e.g., classical SVM, linear programming SVM and least square SVM. Compared against SVM-Perf (whose convergence rate in solving the dual SVM is upper bounded by O(1/√k) where k is the number of iterations) and Pegasos (online SVM that converges at rate O(1/k) for the primal SVM), NESVM achieves the optimal convergence rate at O(1/k²) and a linear time complexity. In particular, NESVM smoothes the nondifferentiable hinge loss and ℓ₁-norm in the primal SVM. Then the optimal gradient method without any line search is adopted to solve the optimization. In each iteration round, the current gradient and historical gradients are combined to determine the descent direction, while the Lipschitz constant determines the step size. Only two matrix-vector multiplications are required in each iteration round. Therefore, NESVM is more efficient than existing SVM solvers. In addition, NESVM is available for both linear and nonlinear kernels. We also propose "homotopy NESVM" to accelerate NESVM by dynamically decreasing the smooth parameter and using the continuation method. Our experiments on census income categorization, indoor/outdoor scene classification event recognition and scene recognition suggest the efficiency and the effectiveness of NESVM. The MATLAB code of NESVM will be available on our website for further assessment.

Keywords :

computational complexity; gradient methods; iterative methods; optimisation; support vector machines; NESVM; Nesterov method; continuation method; convergence rate; gradient method; hinge loss; iteration round; linear time complexity; matrix vector multiplication; support vector machine; $ell_1$ norm; Nesterov´s method; Support vector machines; continuation method; hinge loss; smooth;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Mining (ICDM), 2010 IEEE 10th International Conference on

Conference_Location :

Sydney, NSW

ISSN :

1550-4786

Print_ISBN :

978-1-4244-9131-5

Electronic_ISBN :

1550-4786

Type :

conf

DOI :

10.1109/ICDM.2010.135

Filename :

5694022

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2208789