• DocumentCode
    1805803
  • Title

    Could Decision Trees Improve the Classification Accuracy and Interpretability of Loan Granting Decisions?

  • Author

    Zurada, Jozef

  • Author_Institution
    Dept. of Comput. Inf. Syst., Univ. of Louisville, Louisville, CO, USA
  • fYear
    2010
  • fDate
    5-8 Jan. 2010
  • Firstpage
    1
  • Lastpage
    9
  • Abstract
    The paper compares the classification performance rate of eight models: logistic regression (LR), neural network (NN), radial basis function neural network (RBFNN), support vector machine (SVM), case-base reasoning (CBR), and three decision trees (DTs). We build models and test their classification accuracy rates on a historical data set provided by a German financial institution. The data set contains 21 financial attributes of 1000 customers. Though at the time of loan application all individuals deemed to the institution to be qualified to obtain a loan, 300 of them defaulted upon a loan and 700 paid it off. To obtain reliable and unbiased error estimates for each of the eight models we apply 10-fold cross-validation and repeat an experiment 10 times. We found that in the overall classification accuracy rates at 0.5 probability cut-off, two of the three DT models significantly outperformed (at alpha=0.05) the other remaining models. We then concentrate our attention on DT models and compare their performance at 0.3 and 0.7 cut-off levels which are more likely to be used by financial institutions. The DT models not only classify better than the other models, but the knowledge they learn in the form of if-then rules is easy to interpret, makes sense, and might be of value to financial institutions which may have to explain the reasons for a loan denial.
  • Keywords
    case-based reasoning; decision trees; financial management; radial basis function networks; regression analysis; support vector machines; German financial institution; case-base reasoning; decision trees; historical data set; loan granting decisions; logistic regression; neural network; support vector machine; Classification tree analysis; Credit cards; Decision trees; Fuzzy sets; Logistics; Neural networks; Radial basis function networks; Regression tree analysis; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Sciences (HICSS), 2010 43rd Hawaii International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1530-1605
  • Print_ISBN
    978-1-4244-5509-6
  • Electronic_ISBN
    1530-1605
  • Type

    conf

  • DOI
    10.1109/HICSS.2010.124
  • Filename
    5428636