DocumentCode
2543606
Title
Constructing Decision Tree by Integrating Multiple Information Metrics
Author
Chen, Guang-Hua ; Wang, Zheng-Qun ; Yu, Zhen-Zhou
Author_Institution
Sch. of Inf. Eng., Yangzhou Univ., Yangzhou, China
fYear
2009
fDate
4-6 Nov. 2009
Firstpage
1
Lastpage
5
Abstract
In this paper, a new decision tree construction algorithm (MIDT) is proposed. MIDT (Multiple Informative Decision Tree) uses principal component analysis to integrate information gain, samples distribution information and correlation coefficient as the basis of the selection of splitting attributes. This method can overcome the disadvantage of ID3 decision tree construction method that uses information gain as the splitting attributes selection criteria as a result of its tendency to select the attribute with more values. And moreover, it can exert the complementarity between decision of entropy mean and decision of samples distribution.The results of experiments on the standard data sets provided by UCI show that the decision tree constructed by MIDT has higher classification accuracy and is more stable than ID3 and parametric estimation decision tree algorithm.
Keywords
decision trees; entropy; parameter estimation; pattern classification; principal component analysis; ID3 decision tree construction method; classification accuracy; correlation coefficient; decision tree algorithm; decision tree construction algorithm; distribution information; entropy mean; information gain; multiple information metrics; multiple informative decision tree; parametric estimation; principal component analysis; samples distribution; splitting attributes; standard data sets; Classification tree analysis; Decision trees; Entropy; Principal component analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2009. CCPR 2009. Chinese Conference on
Conference_Location
Nanjing
Print_ISBN
978-1-4244-4199-0
Type
conf
DOI
10.1109/CCPR.2009.5344133
Filename
5344133
Link To Document