• DocumentCode
    2396452
  • Title

    Improving induction decision trees with parallel genetic programming

  • Author

    Folino, Gianluigi ; Pizzuti, Clara ; Spezzano, Giandomenico

  • Author_Institution
    ISI-CNR, Rende, Italy
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    181
  • Lastpage
    187
  • Abstract
    A parallel genetic programming approach to induce decision trees in large data sets is presented. A population of trees is evolved by employing the genetic operators and every individual is evaluated by using a fitness function based on the J-measure. The method is able to deal with large data sets since it uses a parallel implementation of genetic programming through the grid model. Experiments on data sets from the UCI machine learning repository show better results with respect to C5. Furthermore, performance results show a nearly linear speedup
  • Keywords
    data mining; decision trees; genetic algorithms; learning by example; parallel programming; J-measure; UCI machine learning repository; fitness function; genetic operators; grid model; induction decision trees; large data sets; parallel genetic programming; Classification algorithms; Classification tree analysis; Conferences; Data mining; Decision trees; Degradation; Encoding; Genetic programming; Spatial databases; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-based Processing, 2002. Proceedings. 10th Euromicro Workshop on
  • Conference_Location
    Canary Islands
  • Print_ISBN
    0-7695-1444-8
  • Type

    conf

  • DOI
    10.1109/EMPDP.2002.994264
  • Filename
    994264