• DocumentCode
    3265121
  • Title

    Feature Selection Based on SVM for Credit Scoring

  • Author

    Yao, Ping

  • Author_Institution
    Sch. of Econ. & Manage., Heilongjiang Inst. of Sci. & Technol., Harbin, China
  • Volume
    2
  • fYear
    2009
  • fDate
    6-7 June 2009
  • Firstpage
    44
  • Lastpage
    47
  • Abstract
    As the credit industry has been growing rapidly, huge number of consumerspsila credit data are collected by the credit department of the bank and credit scoring has become a very important issue. Usually, a large amount of redundant information and features are involved in the credit dataset, which leads to lower accuracy and higher complexity of the credit scoring model, so, effective feature selection methods are necessary for credit dataset with huge number of features. This paper aims at comparing seven well-known feature selection methods for credit scoring. Which are t-test, principle component analysis (PCA), factor analysis (FA), stepwise regression, rough set (RS), classification and regression tree (CART) and multivariate adaptive regression splines (MARS). Support vector machine (SVM) is used as the classification model. Two credit scoring databases are used in order to provide a reliable conclusion. Regarding the experimental results, the CART and MARS methods outperform the other methods by the overall accuracy and type I error and type II error.
  • Keywords
    banking; support vector machines; bank; classification and regression tree; credit scoring databases; credit scoring model; factor analysis; feature selection method; multivariate adaptive regression splines; principle component analysis; rough set theory; stepwise regression; support vector machine; Classification tree analysis; Computational intelligence; Costs; Mars; Mathematical model; Principal component analysis; Regression tree analysis; Sampling methods; Support vector machine classification; Support vector machines; classification and regression tree; credit scoring; factor analysis; feature selection; multivariate adaptive regression splines; principle component analysis; rough set; stepwise regression; support vector machine; t-test;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Natural Computing, 2009. CINC '09. International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-0-7695-3645-3
  • Type

    conf

  • DOI
    10.1109/CINC.2009.36
  • Filename
    5231051