• DocumentCode
    1391657
  • Title

    Efficient Model Learning Methods for Actor–Critic Control

  • Author

    Grondman, I. ; Vaandrager, M. ; Busoniu, L. ; Babuska, R. ; Schuitema, E.

  • Author_Institution
    Delft Center for Syst. & Control, Delft Univ. of Technol., Delft, Netherlands
  • Volume
    42
  • Issue
    3
  • fYear
    2012
  • fDate
    6/1/2012 12:00:00 AM
  • Firstpage
    591
  • Lastpage
    602
  • Abstract
    We propose two new actor-critic algorithms for reinforcement learning. Both algorithms use local linear regression (LLR) to learn approximations of the functions involved. A crucial feature of the algorithms is that they also learn a process model, and this, in combination with LLR, provides an efficient policy update for faster learning. The first algorithm uses a novel model-based update rule for the actor parameters. The second algorithm does not use an explicit actor but learns a reference model which represents a desired behavior, from which desired control actions can be calculated using the inverse of the learned process model. The two novel methods and a standard actor-critic algorithm are applied to the pendulum swing-up problem, in which the novel methods achieve faster learning than the standard algorithm.
  • Keywords
    control engineering computing; function approximation; learning (artificial intelligence); manipulators; pendulums; position control; regression analysis; actor-critic algorithm; actor-critic control; function approximation; learning policy update; local linear regression; model learning method; model-based update rule; pendulum swing-up problem; reference model learning; reinforcement learning; robotic arm; Approximation algorithms; Encoding; Function approximation; Process control; Actor–critic; inverse model; local linear regression (LLR); machine learning algorithms; reinforcement learning (RL); Algorithms; Artificial Intelligence; Computer Simulation; Decision Support Techniques; Models, Theoretical; Pattern Recognition, Automated;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2011.2170565
  • Filename
    6096441