DocumentCode
3120545
Title
Attribute selection´s impact on robustness of decision trees
Author
Wang, Jin-Feng ; Wang, Xi-Zhao ; Ha, Ming-Hu
Author_Institution
Sch. of Math. & Comput. Sci., Hebei Univ., China
Volume
4
fYear
2002
fDate
4-5 Nov. 2002
Firstpage
1829
Abstract
Most heuristic algorithms for building decision trees are based on the entropy of information. In this article, we introduce a new heuristic algorithm for decision tree generation based on the importance of attribute contributing to the classification, and apply the algorithm to several crisp databases. When the expanded attribute is selected in a specified node, we may have two choices, i.e., sensitive and insensitive attribute. Usually the sensitive attribute is selected for branching the node, but the insensitive attribute is ignored. We compare the two methods from robustness aspects by conducting experiments on several databases, in which the ID3´s robustness is also included. The result indicates that the insensitive method is the most robust one.
Keywords
database management systems; decision trees; knowledge based systems; optimisation; ID3 algorithm; attribute ranking; attribute selection impact; databases; decision trees; heuristic algorithm; knowledge-based systems; node branching; robustness; sensitive attribute; Data mining; Decision trees; Entropy; Heuristic algorithms; Machine learning; Machine learning algorithms; Production facilities; Robustness; Spatial databases; Uncertainty;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN
0-7803-7508-4
Type
conf
DOI
10.1109/ICMLC.2002.1175356
Filename
1175356
Link To Document