Title :
Improving induction decision trees with parallel genetic programming
Author :
Folino, Gianluigi ; Pizzuti, Clara ; Spezzano, Giandomenico
Author_Institution :
ISI-CNR, Rende, Italy
Abstract :
A parallel genetic programming approach to induce decision trees in large data sets is presented. A population of trees is evolved by employing the genetic operators and every individual is evaluated by using a fitness function based on the J-measure. The method is able to deal with large data sets since it uses a parallel implementation of genetic programming through the grid model. Experiments on data sets from the UCI machine learning repository show better results with respect to C5. Furthermore, performance results show a nearly linear speedup
Keywords :
data mining; decision trees; genetic algorithms; learning by example; parallel programming; J-measure; UCI machine learning repository; fitness function; genetic operators; grid model; induction decision trees; large data sets; parallel genetic programming; Classification algorithms; Classification tree analysis; Conferences; Data mining; Decision trees; Degradation; Encoding; Genetic programming; Spatial databases; Testing;
Conference_Titel :
Parallel, Distributed and Network-based Processing, 2002. Proceedings. 10th Euromicro Workshop on
Conference_Location :
Canary Islands
Print_ISBN :
0-7695-1444-8
DOI :
10.1109/EMPDP.2002.994264