DocumentCode :
1391657
Title :
Efficient Model Learning Methods for Actor–Critic Control
Author :
Grondman, I. ; Vaandrager, M. ; Busoniu, L. ; Babuska, R. ; Schuitema, E.
Author_Institution :
Delft Center for Syst. & Control, Delft Univ. of Technol., Delft, Netherlands
Volume :
42
Issue :
3
fYear :
2012
fDate :
6/1/2012 12:00:00 AM
Firstpage :
591
Lastpage :
602
Abstract :
We propose two new actor-critic algorithms for reinforcement learning. Both algorithms use local linear regression (LLR) to learn approximations of the functions involved. A crucial feature of the algorithms is that they also learn a process model, and this, in combination with LLR, provides an efficient policy update for faster learning. The first algorithm uses a novel model-based update rule for the actor parameters. The second algorithm does not use an explicit actor but learns a reference model which represents a desired behavior, from which desired control actions can be calculated using the inverse of the learned process model. The two novel methods and a standard actor-critic algorithm are applied to the pendulum swing-up problem, in which the novel methods achieve faster learning than the standard algorithm.
Keywords :
control engineering computing; function approximation; learning (artificial intelligence); manipulators; pendulums; position control; regression analysis; actor-critic algorithm; actor-critic control; function approximation; learning policy update; local linear regression; model learning method; model-based update rule; pendulum swing-up problem; reference model learning; reinforcement learning; robotic arm; Approximation algorithms; Encoding; Function approximation; Process control; Actor–critic; inverse model; local linear regression (LLR); machine learning algorithms; reinforcement learning (RL); Algorithms; Artificial Intelligence; Computer Simulation; Decision Support Techniques; Models, Theoretical; Pattern Recognition, Automated;
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2011.2170565
Filename :
6096441
Link To Document :
بازگشت