DocumentCode :
1462028
Title :
A new criterion in selection and discretization of attributes for the generation of decision trees
Author :
Jun, Byung Hwan ; Kim, Chang Soo ; Song, Hong-Yeop ; Kim, Jaihie
Author_Institution :
Dept. of Comput. Sci., Kongju Nat. Univ., Chungnam, South Korea
Volume :
19
Issue :
12
fYear :
1997
fDate :
12/1/1997 12:00:00 AM
Firstpage :
1371
Lastpage :
1375
Abstract :
It is important to use a better criterion in selection and discretization of attributes for the generation of decision trees to construct a better classifier in the area of pattern recognition in order to intelligently access huge amount of data efficiently. Two well-known criteria are gain and gain ratio, both based on the entropy of partitions. We propose in this paper a new criterion based also on entropy, and use both theoretical analysis and computer simulation to demonstrate that it works better than gain or gain ratio in a wide variety of situations. We use the usual entropy calculation where the base of the logarithm is not two but the number of successors to the node. Our theoretical analysis leads some specific situations in which the new criterion works always better than gain or gain ratio, and the simulation result may implicitly cover all the other situations not covered by the analysis
Keywords :
decision theory; entropy; pattern classification; trees (mathematics); attribute discretization; attribute selection; classifier; decision tree generation; partition entropy; pattern recognition; Analytical models; Classification tree analysis; Computer Society; Computer simulation; Decision trees; Entropy; Error analysis; Gain measurement; Machine learning; Pattern recognition;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/34.643896
Filename :
643896
Link To Document :
بازگشت