• DocumentCode
    3752991
  • Title

    A comparison of data mining techniques in evaluating retail credit scoring using R programming

  • Author

    Dilmurat Zakirov;Aleksey Bondarev;Nodar Momtselidze

  • Author_Institution
    Information Technologies Department, Demir Kyrgyz International Bank, Bishkek, Kyrgyzstan
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Retail credit scoring has become more efficient in recent years because of the use of data mining techniques that allow marketing officers and top managers to better estimate their customers credibility. In recent years, many complicated models have been developed; however there are few of them which continues to be used because of its efficiency and simplicity. This study investigates k-Nearest Neighbourhood (kNN), support vector machines (SVMs), gradient boosted model (GBM), Naive Bayes classification, Classification and Regression Tree (CART) and Random Forest (RF) as analytical methods for customer credit scoring estimation and evaluation, using real dataset. At the end of the study it is found that Random Forest model with down-sampling (RF_US) has better accuracy rate when compared to other models.
  • Keywords
    "Data mining","Support vector machines","Vegetation","Boosting","Computational modeling","Predictive models","Regression tree analysis"
  • Publisher
    ieee
  • Conference_Titel
    Electronics Computer and Computation (ICECCO), 2015 Twelve International Conference on
  • Type

    conf

  • DOI
    10.1109/ICECCO.2015.7416867
  • Filename
    7416867