• DocumentCode
    1765173
  • Title

    Evolutionary Design of Decision-Tree Algorithms Tailored to Microarray Gene Expression Data Sets

  • Author

    Barros, Rodrigo C. ; Basgalupp, Marcio P. ; Freitas, Alex A. ; de Carvalho, Andre C. P. L. F.

  • Author_Institution
    Fac. de Inf., Pontificia Univ. Catolica do Rio Grande do Sul, Porto Alegre, Brazil
  • Volume
    18
  • Issue
    6
  • fYear
    2014
  • fDate
    Dec. 2014
  • Firstpage
    873
  • Lastpage
    892
  • Abstract
    Decision-tree induction algorithms are widely used in machine learning applications in which the goal is to extract knowledge from data and present it in a graphically intuitive way. The most successful strategy for inducing decision trees is the greedy top-down recursive approach, which has been continuously improved by researchers over the past 40 years. In this paper, we propose a paradigm shift in the research of decision trees: instead of proposing a new manually designed method for inducing decision trees, we propose automatically designing decision-tree induction algorithms tailored to a specific type of classification data set (or application domain). Following recent breakthroughs in the automatic design of machine learning algorithms, we propose a hyper-heuristic evolutionary algorithm called hyper-heuristic evolutionary algorithm for designing decision-tree algorithms (HEAD-DT) that evolves design components of top-down decision-tree induction algorithms. By the end of the evolution, we expect HEAD-DT to generate a new and possibly better decision-tree algorithm for a given application domain. We perform extensive experiments in 35 real-world microarray gene expression data sets to assess the performance of HEAD-DT, and compare it with very well known decision-tree algorithms such as C4.5, CART, and REPTree. Results show that HEAD-DT is capable of generating algorithms that significantly outperform the baseline manually designed decision-tree algorithms regarding predictive accuracy and F-measure.
  • Keywords
    biology computing; decision trees; evolutionary computation; genetics; learning (artificial intelligence); pattern classification; F-measure; HEAD-DT algorithm; application domain; classification data set; decision-tree algorithms; decision-tree induction algorithms; evolutionary design; greedy top-down recursive approach; hyperheuristic evolutionary algorithm; machine learning applications; microarray gene expression data set; Accuracy; Algorithm design and analysis; Decision trees; Evolutionary computation; Machine learning algorithms; Prediction algorithms; Training; Automatic algorithm design; automatic algorithm design; classification; decision trees; evolutionary algorithms; hyper-heuristics; hyperheuristics; machine learning;
  • fLanguage
    English
  • Journal_Title
    Evolutionary Computation, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1089-778X
  • Type

    jour

  • DOI
    10.1109/TEVC.2013.2291813
  • Filename
    6670778