Title :
Simple Hybrid and Incremental Postpruning Techniques for Rule Induction
Author_Institution :
Dept. of Software Eng., Univ. of Eng. & Technol., Taxila, Pakistan
Abstract :
Pruning achieves the dual goal of reducing the complexity of the final hypothesis for improved comprehensibility, and improving its predictive accuracy by minimizing the overfitting due to noisy data. This paper presents a new hybrid pruning technique for rule induction, as well as an incremental postpruning technique based on a misclassification tolerance. Although both have been designed for RULES-7, the latter is also applicable to any rule induction algorithm in general. A thorough empirical evaluation reveals that the proposed techniques enable RULES-7 to outperform other state-of-the-art classification techniques. The improved classifier is also more accurate and up to two orders of magnitude faster than before.
Keywords :
data mining; hybrid postpruning techniques; incremental postpruning techniques; noisy data; rule induction; rule induction algorithm; Accuracy; Classification algorithms; Data mining; Machine learning; Noise measurement; Runtime; Supervised learning; Training data; Overfitting; classification; data mining; inductive learning; knowledge discovery; machine learning; noise handling; pruning; rule induction; supervised learning;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2011.237