Title :
Constructing Decision Tree by Integrating Multiple Information Metrics
Author :
Chen, Guang-Hua ; Wang, Zheng-Qun ; Yu, Zhen-Zhou
Author_Institution :
Sch. of Inf. Eng., Yangzhou Univ., Yangzhou, China
Abstract :
In this paper, a new decision tree construction algorithm (MIDT) is proposed. MIDT (Multiple Informative Decision Tree) uses principal component analysis to integrate information gain, samples distribution information and correlation coefficient as the basis of the selection of splitting attributes. This method can overcome the disadvantage of ID3 decision tree construction method that uses information gain as the splitting attributes selection criteria as a result of its tendency to select the attribute with more values. And moreover, it can exert the complementarity between decision of entropy mean and decision of samples distribution.The results of experiments on the standard data sets provided by UCI show that the decision tree constructed by MIDT has higher classification accuracy and is more stable than ID3 and parametric estimation decision tree algorithm.
Keywords :
decision trees; entropy; parameter estimation; pattern classification; principal component analysis; ID3 decision tree construction method; classification accuracy; correlation coefficient; decision tree algorithm; decision tree construction algorithm; distribution information; entropy mean; information gain; multiple information metrics; multiple informative decision tree; parametric estimation; principal component analysis; samples distribution; splitting attributes; standard data sets; Classification tree analysis; Decision trees; Entropy; Principal component analysis;
Conference_Titel :
Pattern Recognition, 2009. CCPR 2009. Chinese Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-4199-0
DOI :
10.1109/CCPR.2009.5344133