Title :
Attribute selection´s impact on robustness of decision trees
Author :
Wang, Jin-Feng ; Wang, Xi-Zhao ; Ha, Ming-Hu
Author_Institution :
Sch. of Math. & Comput. Sci., Hebei Univ., China
Abstract :
Most heuristic algorithms for building decision trees are based on the entropy of information. In this article, we introduce a new heuristic algorithm for decision tree generation based on the importance of attribute contributing to the classification, and apply the algorithm to several crisp databases. When the expanded attribute is selected in a specified node, we may have two choices, i.e., sensitive and insensitive attribute. Usually the sensitive attribute is selected for branching the node, but the insensitive attribute is ignored. We compare the two methods from robustness aspects by conducting experiments on several databases, in which the ID3´s robustness is also included. The result indicates that the insensitive method is the most robust one.
Keywords :
database management systems; decision trees; knowledge based systems; optimisation; ID3 algorithm; attribute ranking; attribute selection impact; databases; decision trees; heuristic algorithm; knowledge-based systems; node branching; robustness; sensitive attribute; Data mining; Decision trees; Entropy; Heuristic algorithms; Machine learning; Machine learning algorithms; Production facilities; Robustness; Spatial databases; Uncertainty;
Conference_Titel :
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7508-4
DOI :
10.1109/ICMLC.2002.1175356