• DocumentCode
    3285266
  • Title

    Neighborhood Preprocessing SVM for Large-Scale Data Sets Classification

  • Author

    Chen, Guangxi ; Xu, Jian ; Xiang, Xiaolin

  • Author_Institution
    Sch. of Math. & Comput. Sci., Guilin Univ. of Electron. Technol., Guilin
  • Volume
    2
  • fYear
    2008
  • fDate
    18-20 Oct. 2008
  • Firstpage
    245
  • Lastpage
    249
  • Abstract
    Support vector machine (SVM) has been a promising method for data mining and machine learning in recent years. However, the training complexity of SVM is highly dependent on the size of a data set. A preprocessing support vector machines (P-SVM) method for large-scale data set classification is presented to speed up SVM training. By analyzing the neighbor classification feature for each sample in training data set, a decision criterion was built to keep or delete this sample from the original data set without losing the classification. The new method can provide an SVM with high quality samples. Experiments with random data and UCI databases show that SVM with our new preprocessing method retains the high quality of training data set and the classification accuracy very well.
  • Keywords
    classification; data mining; learning (artificial intelligence); support vector machines; data mining; large-scale data sets classification; machine learning; neighbor classification feature; neighborhood preprocessing SVM; support vector machine; Data mining; Fuzzy systems; Kernel; Large-scale systems; Machine learning; Mathematics; Quadratic programming; Support vector machine classification; Support vector machines; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
  • Conference_Location
    Shandong
  • Print_ISBN
    978-0-7695-3305-6
  • Type

    conf

  • DOI
    10.1109/FSKD.2008.94
  • Filename
    4666116