DocumentCode
2396452
Title
Improving induction decision trees with parallel genetic programming
Author
Folino, Gianluigi ; Pizzuti, Clara ; Spezzano, Giandomenico
Author_Institution
ISI-CNR, Rende, Italy
fYear
2002
fDate
2002
Firstpage
181
Lastpage
187
Abstract
A parallel genetic programming approach to induce decision trees in large data sets is presented. A population of trees is evolved by employing the genetic operators and every individual is evaluated by using a fitness function based on the J-measure. The method is able to deal with large data sets since it uses a parallel implementation of genetic programming through the grid model. Experiments on data sets from the UCI machine learning repository show better results with respect to C5. Furthermore, performance results show a nearly linear speedup
Keywords
data mining; decision trees; genetic algorithms; learning by example; parallel programming; J-measure; UCI machine learning repository; fitness function; genetic operators; grid model; induction decision trees; large data sets; parallel genetic programming; Classification algorithms; Classification tree analysis; Conferences; Data mining; Decision trees; Degradation; Encoding; Genetic programming; Spatial databases; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel, Distributed and Network-based Processing, 2002. Proceedings. 10th Euromicro Workshop on
Conference_Location
Canary Islands
Print_ISBN
0-7695-1444-8
Type
conf
DOI
10.1109/EMPDP.2002.994264
Filename
994264
Link To Document