DocumentCode :
2858414
Title :
Data pre-processing by genetic algorithms for bankruptcy prediction
Author :
Tsai, Chih-Fong ; Chou, Jui-Sheng
Author_Institution :
Dept. of Inf. Manage., Nat. Central Univ., Jhongli, Taiwan
fYear :
2011
fDate :
6-9 Dec. 2011
Firstpage :
1780
Lastpage :
1783
Abstract :
Bankruptcy prediction has been approached by data mining techniques. However, since data pre-processing including feature selection or dimensionality reduction and data reduction is a very important stage for successful data mining, very few consider performing both tasks to examine the impact of data pre-processing on prediction performance. This paper applies genetic algorithms, which have been widely used for the data pre-processing tasks, for feature selection and data reduction over a public bankruptcy prediction dataset. In particular, the experiments based on different priorities of performing feature selection and data reduction are conducted. The results show that performing data reduction only can allow the support vector machine (SVM) classifier to provide the highest rate of prediction accuracy. However, executing both feature selection and data reduction with different priorities performs the same. They not only largely reduce the dataset size, but also keep the similar performance as SVM without data pre-processing.
Keywords :
data mining; financial management; genetic algorithms; pattern classification; support vector machines; data mining techniques; data preprocessing; data reduction; dimensionality reduction; feature selection; genetic algorithms; public bankruptcy prediction dataset; support vector machine classifier; Accuracy; Classification algorithms; Data mining; Genetic algorithms; Machine learning; Support vector machines; Training; Bankruptcy prediction; data mining; data pre-processing genetic algorithms; data reduction; feature selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Industrial Engineering and Engineering Management (IEEM), 2011 IEEE International Conference on
Conference_Location :
Singapore
ISSN :
2157-3611
Print_ISBN :
978-1-4577-0740-7
Electronic_ISBN :
2157-3611
Type :
conf
DOI :
10.1109/IEEM.2011.6118222
Filename :
6118222
Link To Document :
بازگشت