• DocumentCode
    2460666
  • Title

    Data Mining based on Gene Expression Programming and Clonal Selection

  • Author

    Karakasis, Vassilios K. ; Stafylopatis, Andreas

  • Author_Institution
    Nat. Tech. Univ. of Athens, Athens
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    514
  • Lastpage
    521
  • Abstract
    A hybrid evolutionary technique is proposed for data mining tasks, which combines the Clonal Selection Principle with Gene Expression Programming (GEP). The proposed algorithm introduces the notion of Data Class Antigens, which is used to represent a class of data. The produced rules are evolved by a clonal selection algorithm, which extends the recently proposed CLONALG algorithm. In the present algorithm, among other new features, a receptor editing step has been incorporated. Moreover, the rules themselves are represented as antibodies, which are coded as GEP chromosomes, in order to exploit the flexibility and the expressiveness of such encoding. The algorithm is tested on some benchmark problems of the UCI repository, and in particular on the set of MONK problems and the Pima Indians Diabetes problem. In both problems, the results in terms of prediction accuracy are very satisfactory, albeit slightly less accurate than those obtained by a standard GEP technique. In terms of convergence rate and computational efficiency, however, the technique proposed here markedly outperforms the standard GEP algorithm.
  • Keywords
    biology computing; data mining; evolutionary computation; CLONALG algorithm; MONK problems; Pima Indians Diabetes problem; UCI repository; clonal selection; data class antigens; data mining; gene expression programming; hybrid evolutionary technique; Accuracy; Benchmark testing; Biological cells; Computational efficiency; Convergence; Data mining; Diabetes; Encoding; Gene expression; Genetic programming;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2006. CEC 2006. IEEE Congress on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    0-7803-9487-9
  • Type

    conf

  • DOI
    10.1109/CEC.2006.1688353
  • Filename
    1688353