• DocumentCode
    1181051
  • Title

    Gradient descent methods in learning classifier systems: improving XCS performance in multistep problems

  • Author

    Butz, Martin V. ; Goldberg, David E. ; Lanzi, Pier Luca

  • Author_Institution
    Dept. of Gen. Eng., Univ. of Illinois, Urbana, IL, USA
  • Volume
    9
  • Issue
    5
  • fYear
    2005
  • Firstpage
    452
  • Lastpage
    473
  • Abstract
    The accuracy-based XCS classifier system has been shown to solve typical data mining problems in a machine-learning competitive way. However, successful applications in multistep problems, modeled by a Markov decision process, were restricted to very small problems. Until now, the temporal difference learning technique in XCS was based on deterministic updates. However, since a prediction is actually generated by a set of rules in XCS and Learning Classifier Systems in general, gradient-based update methods are applicable. The extension of XCS to gradient-based update methods results in a classifier system that is more robust and more parameter independent, solving large and difficult maze problems reliably. Additionally, the extension to gradient methods highlights the relation of XCS to other function approximation methods in reinforcement learning.
  • Keywords
    Markov processes; data mining; evolutionary computation; function approximation; gradient methods; learning (artificial intelligence); Markov decision process; XCS performance; data mining; evolutionary computation; function approximation method; gradient descent method; learning classifier system; machine learning; multistep problem; reinforcement learning; Data mining; Evolutionary computation; Function approximation; Genetic algorithms; Genetic engineering; Government; Laboratories; Learning systems; Robustness; Zero current switching; Function approximation; Q-learning; XCS; gradient descent; learning classifier systems (LCSs); multistop problems; reinforcement learning;
  • fLanguage
    English
  • Journal_Title
    Evolutionary Computation, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1089-778X
  • Type

    jour

  • DOI
    10.1109/TEVC.2005.850265
  • Filename
    1514471