Title :
A New Approach for Decision Tree Based on Principal Component Analysis
Author :
Hu, Juanli ; Deng, Jiabin ; Sui, Mingxiang
Author_Institution :
Dept. of Comput. Eng., Zhongshan Polytech., Zhongshan, China
Abstract :
Classification algorithm has always been a hot issue in data mining. Decision tree algorithm is the most active part in this area, but it is a NP problem to construct the optimization decision tree. With the development of the information collection technology, the requirements of the mass data mining have become increasingly higher. When dealing with large, continuous, even with the noise and abnormal data, the traditional decision tree algorithm seems very incompetent, encountering the efficiency of the bottleneck and classification error. In this paper, there exist the shortcomings for the decision tree algorithm to deal with multi-attribute data sources. The multivariate statistical methods is proposed to make the principal component analysis on multi-attribute data, reducing dimensionality, devoicing processing and transforming the traditional decision tree algorithm to form a new algorithm model. Comparing with the traditional decision tree algorithm, the experimental results show that this method can not only simplify the decision tree model, but also can improve prediction accuracy of the decision tree.
Keywords :
data mining; decision trees; pattern classification; principal component analysis; NP problem; classification algorithm; data mining; decision tree; devoicing processing; dimensionality reduction; information collection technology; multiattribute data sources; multivariate statistical method; optimization; prediction accuracy; principal component analysis; Classification tree analysis; Computer errors; Data engineering; Data mining; Decision trees; Predictive models; Principal component analysis; Statistical analysis; Testing; Tree data structures;
Conference_Titel :
Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4507-3
Electronic_ISBN :
978-1-4244-4507-3
DOI :
10.1109/CISE.2009.5366006