• DocumentCode
    476748
  • Title

    Comparing the knowledge quality in rough classifier and decision tree classifier

  • Author

    Mohsin, Mohamad Farhan Mohamad ; Wahab, Mohd Helmy Abd

  • Author_Institution
    College of Arts & Sciences, Universiti Utara Malaysia, 06010 UUM Sintok, Kedah, Malaysia
  • Volume
    2
  • fYear
    2008
  • fDate
    26-28 Aug. 2008
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper presents a comparative study of two rule based classifier; rough set (Rc) and decision tree (DTc). Both techniques apply different approach to perform classification but produce same structure of output with comparable result. Theoretically, different classifiers will generate different sets of rules via knowledge even though they are implemented to the same classification problem. Hence, the aim of this paper is to investigate the quality of knowledge produced by Rc and DTc when similar problems are presented to them. In this case, four important performance metrics are used as comparison, the accuracy of classification, rules quantity, rules length and rules coverage. Five dataset from UCI Machine Learning are chosen and then mined using Rc toolkit namely ROSETTA while C4.5 algorithm in WEKA application is chosen as DTc rule generator. The experimental result shows that Rc and DTc own capability to generate quality knowledge since most of the results are comparable. Rc outperform as an accurate classifier, produce shorter and simpler rule with higher coverage. Meanwhile, DTc obviously generates fewer numbers of rules with significant difference.
  • Keywords
    Art; Classification tree analysis; Data engineering; Data mining; Decision trees; Educational institutions; Machine learning; Machine learning algorithms; Measurement; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology, 2008. ITSim 2008. International Symposium on
  • Conference_Location
    Kuala Lumpur, Malaysia
  • Print_ISBN
    978-1-4244-2327-9
  • Electronic_ISBN
    978-1-4244-2328-6
  • Type

    conf

  • DOI
    10.1109/ITSIM.2008.4631700
  • Filename
    4631700