DocumentCode
476748
Title
Comparing the knowledge quality in rough classifier and decision tree classifier
Author
Mohsin, Mohamad Farhan Mohamad ; Wahab, Mohd Helmy Abd
Author_Institution
College of Arts & Sciences, Universiti Utara Malaysia, 06010 UUM Sintok, Kedah, Malaysia
Volume
2
fYear
2008
fDate
26-28 Aug. 2008
Firstpage
1
Lastpage
6
Abstract
This paper presents a comparative study of two rule based classifier; rough set (Rc ) and decision tree (DTc ). Both techniques apply different approach to perform classification but produce same structure of output with comparable result. Theoretically, different classifiers will generate different sets of rules via knowledge even though they are implemented to the same classification problem. Hence, the aim of this paper is to investigate the quality of knowledge produced by Rc and DTc when similar problems are presented to them. In this case, four important performance metrics are used as comparison, the accuracy of classification, rules quantity, rules length and rules coverage. Five dataset from UCI Machine Learning are chosen and then mined using Rc toolkit namely ROSETTA while C4.5 algorithm in WEKA application is chosen as DTc rule generator. The experimental result shows that Rc and DTc own capability to generate quality knowledge since most of the results are comparable. Rc outperform as an accurate classifier, produce shorter and simpler rule with higher coverage. Meanwhile, DTc obviously generates fewer numbers of rules with significant difference.
Keywords
Art; Classification tree analysis; Data engineering; Data mining; Decision trees; Educational institutions; Machine learning; Machine learning algorithms; Measurement; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology, 2008. ITSim 2008. International Symposium on
Conference_Location
Kuala Lumpur, Malaysia
Print_ISBN
978-1-4244-2327-9
Electronic_ISBN
978-1-4244-2328-6
Type
conf
DOI
10.1109/ITSIM.2008.4631700
Filename
4631700
Link To Document