• DocumentCode
    481677
  • Title

    Cluster Reduction Support Vector Machine for Large-Scale Data Set Classification

  • Author

    Chen, Guangxi ; Cheng, Yan ; Xu, Jian

  • Author_Institution
    Sch. of Math. & Comput. Sci., Guilin Univ. of Electron. Technol., Guilin
  • Volume
    1
  • fYear
    2008
  • fDate
    19-20 Dec. 2008
  • Firstpage
    8
  • Lastpage
    12
  • Abstract
    Support vector machine (SVM) has been a promising method for data mining and machine learning in recent years. However, the training complexity of SVM is highly dependent on the size of a data set. A cluster support vector machines (C-SVM) method for large-scale data set classification is presented to accelerate the training speed. By calculating cluster mirror radius ratio and representative sample selection in each cluster, the original training data set can be reduced remarkably without losing the classification information. The new method can provide an SVM with high quality samples in lower time consuming. Experiments with random data and UCI databases show that the C-SVM retains the high quality of training data set and the classification accuracy in data mining.
  • Keywords
    data mining; learning (artificial intelligence); pattern classification; pattern clustering; support vector machines; SVM training; UCI database; cluster reduction support vector machine; data mining; large-scale data set classification; machine learning; sample selection; Conferences; Data mining; Kernel; Large-scale systems; Machine learning; Mathematics; Quadratic programming; Support vector machine classification; Support vector machines; Training data; Cluster reduction; Data mining; Support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Industrial Application, 2008. PACIIA '08. Pacific-Asia Workshop on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-0-7695-3490-9
  • Type

    conf

  • DOI
    10.1109/PACIIA.2008.43
  • Filename
    4756514