Title :
A Post-Pruning Decision Tree Algorithm Based on Bayesian
Author :
Wenchao Zhang ; Yafen Li
Author_Institution :
Sch. of Electron. Inf. & Control Eng., Beijing Univ. of Technol., Beijing, China
Abstract :
The C4.5 Algorithm can result in a thriving decision tree and will overfit the training data while training the model. In order to overcome those disadvantages, this paper proposed a post-pruning decision tree algorithm based on Bayesian theory, in which each branch of the decision tree generated by the C4.5 algorithm is validated by Bayesian theorem, and then those branches that do not meet the conditions will be removed from the decision tree, at last a simple decision tree will be generated. The proposed algorithm can be verified by the data provided by the Beijing key disciplines platform and the Beijing Master and Dr. Platform. The result shows that the algorithm can the most unreliable and uneven branches. And compared with the C4.5 algorithm, the proposed algorithm has a higher prediction accuracy and a broader coverage.
Keywords :
belief networks; data mining; decision trees; Bayesian theory; Beijing Master and Dr. platform; Beijing key disciplines platform; C4.5 algorithm; post-pruning decision tree algorithm; Accuracy; Bayes methods; Classification algorithms; Decision trees; Medical services; Prediction algorithms; Training; Bayesian theory; Data Mining; Decision tree; post-pruning;
Conference_Titel :
Computational and Information Sciences (ICCIS), 2013 Fifth International Conference on
Conference_Location :
Shiyang
DOI :
10.1109/ICCIS.2013.265