• DocumentCode
    1831493
  • Title

    Diversity of feature selection approaches combined with distinct classifiers

  • Author

    Li Feng-Chia ; Wang Peng-Kai ; Yeh Li-Lon

  • Author_Institution
    Dept. of Inf. Manage., Jen Teh Junior Coll., Miaoli, Taiwan
  • fYear
    2010
  • fDate
    7-10 Dec. 2010
  • Firstpage
    28
  • Lastpage
    32
  • Abstract
    The credit scoring has been regarded as a critical topic and its related departments make efforts to collect huge amount of data to avoid wrong decision. An effective classificatory model will objectively help managers instead of intuitive experience. This study proposes five approaches combining with the back-propagation neural network (BPN) classifier for features selection that retains sufficient information for classification purpose. Different credit scoring models are constructed by selecting attributes with five approaches. Two UCI (University of California, Irvine) data sets are chosen to evaluate the accuracy of various hybrid-BPN models. BPN classifier combines with conventional statistical LDA, Decision tree, Rough sets theory, F-score and Gray relation approaches as features preprocessing step to optimize feature space by removing both irrelevant and redundant features. In this paper, the procedure of the proposed approaches will be described and then evaluated by their performances. The results are compared in combination with BPN classifier and nonparametric Wilcoxon signed rank test will be held to show if there is any significant difference between these models. The result in this study suggests that hybrid credit scoring approach is mostly robust and effective in finding optimal subsets and is a promising method to the fields of data mining.
  • Keywords
    backpropagation; decision trees; neural nets; pattern classification; rough set theory; statistical analysis; F-score; backpropagation neural network classifier; decision tree; distinct classifiers; feature selection approaches; gray relation approaches; nonparametric Wilcoxon signed rank test; rough sets theory; statistical LDA; Accuracy; Classification algorithms; Computational modeling; Data mining; Data models; Decision trees; Rough sets; Back-propagation neural network; Decision tree; F-score; Gray relational analysis; Linear discriminate analysis; Rough sets theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Engineering and Engineering Management (IEEM), 2010 IEEE International Conference on
  • Conference_Location
    Macao
  • ISSN
    2157-3611
  • Print_ISBN
    978-1-4244-8501-7
  • Electronic_ISBN
    2157-3611
  • Type

    conf

  • DOI
    10.1109/IEEM.2010.5674600
  • Filename
    5674600