• DocumentCode
    2753464
  • Title

    The Assessment and Application of Lineage Information in Genetic Programs for Producing Better Models

  • Author

    Boetticher, Gary D. ; Kaminsky, Kim

  • Author_Institution
    Houston Univ., TX
  • fYear
    2006
  • fDate
    16-18 Sept. 2006
  • Firstpage
    141
  • Lastpage
    146
  • Abstract
    One of the challenges in data mining, and in particular genetic programs, is to provide sufficient coverage of the search space in order to produce an acceptable model. Traditionally, genetic programs generate equations (chromosomes) and consider all chromosomes within a population for breeding purposes. Considering the enormity of the search space for complex problems, it is imperative to examine genetic programs breeding efforts in order to produce better solutions with less training. This research examines chromosome lineage within genetic programs in order to identify breeding patterns. Fitness values for chromosomes are sorted, then partitioned into five classes. Initial experiments reveal a distinct difference between upper, middle, and lower classes. Based upon initial results, a novel genetic programming process is proposed which breeds a new generation exclusively from the top 20 percent of a population. A second set of experiments statistically validate this proposed approach
  • Keywords
    data mining; genetic algorithms; search problems; data mining; genetic programming; lineage information; search space; Application software; Biological cells; Chromosome mapping; Data mining; Equations; Genetic programming; Lakes; Solids; Space exploration;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration, 2006 IEEE International Conference on
  • Conference_Location
    Waikoloa Village, HI
  • Print_ISBN
    0-7803-9788-6
  • Type

    conf

  • DOI
    10.1109/IRI.2006.252403
  • Filename
    4018480