• DocumentCode
    3058953
  • Title

    Large Scale Classification with Support Vector Machine Algorithms

  • Author

    Do, Thanh-Nghi ; Fekete, Jean-Daniel

  • Author_Institution
    Univ. Paris-Sud, Paris
  • fYear
    2007
  • fDate
    13-15 Dec. 2007
  • Firstpage
    7
  • Lastpage
    12
  • Abstract
    Boosting of least-squares support vector machine (LS-SVM) algorithms can classify large datasets on standard personal computers (PCs). We extend the LS-SVM proposed by Suykens and Vandewalle in several ways to efficiently classify large datasets. We developed a row-incremental version for datasets with billions of data points and up to 10,000 dimensions. By adding a Tikhonov regularization term and using the Sherman-Morrison-Woodbury formula, we developed a column-incremental LS-SVM to process datasets with a small number of data points but very high dimensionality. Finally, by applying boosting to these incremental LS-SVM algorithms, we developed classification algorithms for massive, very-high-dimensional datasets, and we also applied these ideas to build boosting of other efficient SVM algorithms proposed by Mangasarian, including Lagrange SVM (LSVM), proximal SVM (PSVM) and Newton SVM (NSVM). Numerical test results on UCI, RCV1- binary, Reuters-21578, Forest cover type and KDD cup 1999 datasets showed that our algorithms are often significantly faster and/or more accurate than state-of- the-art algorithms LibSVM, SVM-perf and CB-SVM.
  • Keywords
    least squares approximations; support vector machines; very large databases; Sherman-Morrison-Woodbury formula; Tikhonov regularization term; column-incremental LS-SVM; large datasets; large scale classification; least-squares support vector machine; row-incremental version; Boosting; Classification algorithms; Lagrangian functions; Large-scale systems; Machine learning; Machine learning algorithms; Microcomputers; Personal communication networks; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
  • Conference_Location
    Cincinnati, OH
  • Print_ISBN
    978-0-7695-3069-7
  • Type

    conf

  • DOI
    10.1109/ICMLA.2007.25
  • Filename
    4457200