Title :
A New Decision Tree Algorithm Based on Rough Set Theory
Author :
Ding, Baoshi ; Zheng, Yongqing ; Zang, Shaoyu
Author_Institution :
Dept. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
Abstract :
Decision tree algorithm has been widely used to classify numeric and categorical attributes. Lots of approaches were suggested in order to induce decision trees. ID3 (Quinlan, 1986), as a heuristic algorithm, is very classic and popular in the induction of decision trees. The key of ID3 is to choose information gain as the standard for testing attributes. In this paper, we propose a novel measure based on rough set theory to select attributes that will best split current samples into individual classes. In the view of rough set theory, we analyze the shortcomings of ID3 algorithm and rationality of the new approach, and then propose a fixed algorithm based on original idea. The results of example and experiments show that our approach is better in selecting nodes for inducing decision trees than ID3.
Keywords :
data mining; decision trees; heuristic programming; rough set theory; ID3; attribute reduction; categorical attribute classification; data mining; decision tree algorithm; heuristic algorithm; information gain; rough set theory; Classification algorithms; Classification tree analysis; Computer science; Current measurement; Data mining; Decision trees; Heuristic algorithms; Information processing; Set theory; Testing; Classification; Information Gain; Rough Sets;
Conference_Titel :
Information Processing, 2009. APCIP 2009. Asia-Pacific Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-0-7695-3699-6
DOI :
10.1109/APCIP.2009.216