• DocumentCode
    232477
  • Title

    Building predictive models in two stages with meta-learning templates optimized by genetic programming

  • Author

    Kordik, Pavel ; Cerny, Jan

  • fYear
    2014
  • fDate
    9-12 Dec. 2014
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    The model selection stage is one of the most difficult in predictive modeling. To select a model with a highest generalization performance involves benchmarking huge number of candidate models or algorithms. Often, a final model is selected without considering potentially high quality candidates just because there are too many possibilities. Improper benchmarking methodology often leads to biased estimates of model generalization performance. Automation of the model selection stage is possible, however the computational complexity is huge especially when ensembles of models and optimization of input features should be also considered. In this paper we show, how to automate model selection process in a way that allows to search for complex hierarchies of ensemble models while maintaining computational tractability. We introduce two-stage learning, meta-learning templates optimized by evolutionary programming with anytime properties to be able to deliver and maintain data-tailored algorithms and models in a reasonable time without human interaction. Co-evolution if inputs together with optimization of templates enabled to solve algorithm selection problem efficiently for variety of datasets.
  • Keywords
    computational complexity; data mining; feature selection; genetic algorithms; learning (artificial intelligence); computational complexity; data mining algorithm; evolutionary programming; genetic programming; input feature optimization; metalearning template; model selection stage; predictive modelling; Boosting; Computational modeling; Data models; Genetic programming; Optimization; Sociology; Statistics; Bagging; Boosting; Cascade generalization; Ensembleclassifiers; Genetic programming; Hierarchical ensembles; Meta-learning; Non-stationary environments; Stacking; Transfer learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Ensemble Learning (CIEL), 2014 IEEE Symposium on
  • Conference_Location
    Orlando, FL
  • Type

    conf

  • DOI
    10.1109/CIEL.2014.7015740
  • Filename
    7015740