DocumentCode :
3847059
Title :
Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions
Author :
Lucian Busoniu;Damien Ernst;Bart De Schutter;Robert Babuska
Author_Institution :
Delft Center for Systems and Control, Delft University of Technology, Delft, The Netherlands
Volume :
41
Issue :
1
fYear :
2011
Firstpage :
196
Lastpage :
209
Abstract :
This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-loop policy that can be represented using a given number of basis functions (BFs), where a discrete action is assigned to each BF. The type of the BFs and their number are specified in advance and determine the complexity of the representation. Considerable flexibility is achieved by optimizing the locations and shapes of the BFs, together with the action assignments. The optimization is carried out with the cross-entropy method and evaluates the policies by their empirical return from a representative set of initial states. The return for each representative state is estimated using Monte Carlo simulations. The resulting algorithm for cross-entropy policy search with adaptive BFs is extensively evaluated in problems with two to six state variables, for which it reliably obtains good policies with only a small number of BFs. In these experiments, cross-entropy policy search requires vastly fewer BFs than value-function techniques with equidistant BFs, and outperforms policy search with a competing optimization algorithm called DIRECT.
Keywords :
"Programmable control","Adaptive control","Automatic control","Marine technology","Shape","Stochastic processes","Control systems","Optimization methods","State estimation","Operations research"
Journal_Title :
IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2010.2050586
Filename :
5491120
Link To Document :
بازگشت