• DocumentCode
    3116916
  • Title

    Support Vector classification for large data sets by reducing training data with change of classes

  • Author

    Cervantes, Jair ; Li, XiaoOu ; Wen Yu

  • Author_Institution
    Dept. of Comput. Sci., CINVESTAV del I.P.N., Mexico City
  • fYear
    2008
  • fDate
    12-15 Oct. 2008
  • Firstpage
    2609
  • Lastpage
    2614
  • Abstract
    In recent years support vector machines (SVM) has received considerable attention due to its high generalization ability and performance for a wide range of applications. However, the most important problem of this method is slow training for classification problems with a large data sets because the quadratic form is completely dense and the memory requirements grow with the square of the number of data points. This paper presents a novel SVM classification approach for large data sets by reducing training data and train the support vector machine using only these data. In this algorithm, a first stage uses SVM classification on a small data set in order to gets a sketch of classes distribution and labels the support vectors as a data set with label +1 and the other points as a data set with label -1. We call this change of classes. Then the algorithm obtains the classification hyperplane and classify the original input data set, the data points obtained with label +1 constitute the data points in the boundary of each original class and represent the most important data points, these data points are used as training data for a posterior SVM classification. The effectiveness of the approach proposed is supported by experimental results.
  • Keywords
    data reduction; pattern classification; support vector machines; SVM classification; classification problems; support vector classification; training data reduction; Classification algorithms; Computational efficiency; Computer science; Kernel; Matrix decomposition; Probability distribution; Sampling methods; Support vector machine classification; Support vector machines; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on
  • Conference_Location
    Singapore
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-2383-5
  • Electronic_ISBN
    1062-922X
  • Type

    conf

  • DOI
    10.1109/ICSMC.2008.4811689
  • Filename
    4811689