• Title of article

    A constrained-syntax genetic programming system for discovering classification rules: application to medical data sets

  • Author/Authors

    E.E. and Bojarczuk، نويسنده , , Celia C. and Lopes، نويسنده , , Heitor S. and Freitas، نويسنده , , Alex A. and Michalkiewicz، نويسنده , , Edson L.، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2004
  • Pages
    22
  • From page
    27
  • To page
    48
  • Abstract
    This paper proposes a new constrained-syntax genetic programming (GP) algorithm for discovering classification rules in medical data sets. The proposed GP contains several syntactic constraints to be enforced by the system using a disjunctive normal form representation, so that individuals represent valid rule sets that are easy to interpret. The GP is compared with C4.5, a well-known decision-tree-building algorithm, and with another GP that uses Boolean inputs (BGP), in five medical data sets: chest pain, Ljubljana breast cancer, dermatology, Wisconsin breast cancer, and pediatric adrenocortical tumor. For this last data set a new preprocessing step was devised for survival prediction. Computational experiments show that, overall, the GP algorithm obtained good results with respect to predictive accuracy and rule comprehensibility, by comparison with C4.5 and BGP.
  • Keywords
    Constrained-syntax genetic programming , Classification rules , DATA MINING
  • Journal title
    Artificial Intelligence In Medicine
  • Serial Year
    2004
  • Journal title
    Artificial Intelligence In Medicine
  • Record number

    1836073