• DocumentCode
    2860489
  • Title

    A New Approach for Decision Tree Based on Principal Component Analysis

  • Author

    Hu, Juanli ; Deng, Jiabin ; Sui, Mingxiang

  • Author_Institution
    Dept. of Comput. Eng., Zhongshan Polytech., Zhongshan, China
  • fYear
    2009
  • fDate
    11-13 Dec. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Classification algorithm has always been a hot issue in data mining. Decision tree algorithm is the most active part in this area, but it is a NP problem to construct the optimization decision tree. With the development of the information collection technology, the requirements of the mass data mining have become increasingly higher. When dealing with large, continuous, even with the noise and abnormal data, the traditional decision tree algorithm seems very incompetent, encountering the efficiency of the bottleneck and classification error. In this paper, there exist the shortcomings for the decision tree algorithm to deal with multi-attribute data sources. The multivariate statistical methods is proposed to make the principal component analysis on multi-attribute data, reducing dimensionality, devoicing processing and transforming the traditional decision tree algorithm to form a new algorithm model. Comparing with the traditional decision tree algorithm, the experimental results show that this method can not only simplify the decision tree model, but also can improve prediction accuracy of the decision tree.
  • Keywords
    data mining; decision trees; pattern classification; principal component analysis; NP problem; classification algorithm; data mining; decision tree; devoicing processing; dimensionality reduction; information collection technology; multiattribute data sources; multivariate statistical method; optimization; prediction accuracy; principal component analysis; Classification tree analysis; Computer errors; Data engineering; Data mining; Decision trees; Predictive models; Principal component analysis; Statistical analysis; Testing; Tree data structures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-4507-3
  • Electronic_ISBN
    978-1-4244-4507-3
  • Type

    conf

  • DOI
    10.1109/CISE.2009.5366006
  • Filename
    5366006