• DocumentCode
    3415086
  • Title

    Sample Reduction for SVMs via Data Structure Analysis

  • Author

    Wang, Defeng ; Yeung, Daniel S. ; Eric, C.C.T.

  • Author_Institution
    Dept. of Comput., Hong Kong Polytech. Univ., Hong Kong
  • fYear
    2007
  • fDate
    16-18 April 2007
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper presents a new sample reduction algorithm, sample reduction by data structure analysis (SR-DSA), for SVMs to improve their scalability. SR-DSA utilizes data structure information in determining which data points are not useful in learning the separating plane and could be removed. As this algorithm is performed before SVMs training, it avoids the problem suffered by most sample reduction methods whose choices of samples heavily depend on repeatedly training of SVMs. Experiments on both synthetic and real world datasets have shown that SR-DSA is capable of reducing the number of samples as well as the time for SVMs training while maintaining high testing accuracy.
  • Keywords
    data structures; support vector machines; SVM; data structure; sample reduction; Data analysis; Data structures; Kernel; Probability distribution; Quadratic programming; Sampling methods; Scalability; Statistical learning; Support vector machines; Testing; Hierarchical Clustering; Mahalanobis distance; Sample Reduction; Support Vector Machines (SVMs);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System of Systems Engineering, 2007. SoSE '07. IEEE International Conference on
  • Conference_Location
    San Antonio, TX
  • Print_ISBN
    1-4244-1159-9
  • Electronic_ISBN
    1-4244-1160-2
  • Type

    conf

  • DOI
    10.1109/SYSOSE.2007.4304333
  • Filename
    4304333