Title :
Classification by essential emerging patterns in two phases
Author :
Fan, Ming ; Zhi, Wei-Mei ; Fan, Hong-Jian ; Sun, Yig-Ui
Author_Institution :
Dept. of Comput. Sci., Zhengzhou Univ., Henan, China
Abstract :
PNrule is a new two-phase framework for learning classifier models in data mining. The first phase detects the presence of the target class, while the second detects the absence of the target class. Emerging patterns (EPs) are itemsets whose supports change significantly from one data class to another, while essential emerging patterns (eEPs) are minimal ones among the EPs. It has been shown that the eEPs are useful and sufficient for building accurate classifiers. This work proposes a novel classification method, called CEEPTP, to combine the idea of two-phase induction and classification by EPs. The experiment study carried on 15 benchmark datasets from the UCI machine learning repository shows that CEEPTP performs comparably with other classification methods such as CBA, CMAR, C5.0, NB, and CAEP in terms of overall predictive accuracy.
Keywords :
data mining; learning (artificial intelligence); pattern classification; benchmark datasets; data mining; emerging patterns; learning classifier models; machine learning; pattern classification; target class detection; two phase classification; two phase induction; Accuracy; Computer science; Data mining; Electronic mail; Itemsets; Machine learning; Niobium; Phase detection; Statistics; Sun;
Conference_Titel :
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN :
0-7803-8403-2
DOI :
10.1109/ICMLC.2004.1381993