Title :
Evolutionary data mining approaches for rule-based and tree-based classifiers
Author :
Weise, Thomas ; Chiong, Raymond
Author_Institution :
Nature Inspired Comput. & Applic. Lab., Univ. of Sci. & Technol. of China, Hefei, China
Abstract :
Data mining is an important process, with applications found in many business, science and industrial problems. While a wide variety of algorithms have already been proposed in the literature for classification tasks in large data sets, and the majority of them have been proven to be very effective, not all of them are flexible and easily extensible. In this paper, we introduce two new approaches for synthesizing classifiers with Evolutionary Algorithms (EAs) in supervised data mining scenarios. The first method is based on encoding rule sets with bit string genomes and the second one utilizes Genetic Programming to create decision trees with arbitrary expressions attached to the nodes. Comparisons with some sophisticated standard approaches, such as C4.5 and Random-Forest, show that the performance of the evolved classifiers can be very competitive. We further demonstrate that both proposed approaches work well across different configurations of the EAs.
Keywords :
data mining; decision trees; genetic algorithms; knowledge based systems; pattern classification; C4.5 approach; decision trees; evolutionary algorithms; evolutionary data mining approach; genetic programming; random-forest approach; rule set encoding; rule-based classifier; supervised data mining approach; tree-based classifiers; Data mining; Decision trees; Evolutionary computation; Heart; Iris; Optimization; Training; data mining; decision trees; evolutionary algorithms; rule-based classifiers;
Conference_Titel :
Cognitive Informatics (ICCI), 2010 9th IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-8041-8
DOI :
10.1109/COGINF.2010.5599821