DocumentCode
169771
Title
Pruning with Majority and Minority Properties
Author
Hae Sook Jeon ; Won Don Lee
Author_Institution
IT Convergence Technol. Res. Lab., Electron. & Telecommun. Res. Inst., Daejeon, South Korea
fYear
2014
fDate
6-9 May 2014
Firstpage
1
Lastpage
4
Abstract
Classification is very imprtant research in knowledge discovery and machine learning. The decision tree is one of the well-known data mining methods. In general, a decision tree can be grown so as to have zero eeor on the training data set. If there is any noise in the data set or it does not completely cover the decision space, then over-fitting occurs and the tree needs to be pruned in order to accurately generalize the test data set. In this paper, we propose a pre-pruning method with majority and minority properties for the decision tree. It uses two kinds of qualifying criteria to consider whether the ration of the highest class of a subtree is the majority of the subtree or a minority of the overall tree. New measures for these are added to the classifier with the extended data expression. Experiments show that a clasifier using this pruning method can improve classification accuracy as well as reduce the size of the tree.
Keywords
data mining; decision trees; learning (artificial intelligence); pattern classification; classification accuracy; data expression; data mining methods; decision tree; knowledge discovery; machine learning; pattern classification; pre-pruning method; subtree; Accuracy; Decision trees; Equations; Error analysis; Gain measurement; Rain; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science and Applications (ICISA), 2014 International Conference on
Conference_Location
Seoul
Print_ISBN
978-1-4799-4443-9
Type
conf
DOI
10.1109/ICISA.2014.6847450
Filename
6847450
Link To Document