DocumentCode
3572793
Title
Optimal learning control for discrete-time nonlinear systems using generalized policy iteration based adaptive dynamic programming
Author
Qinglai Wei ; Derong Liu
Author_Institution
State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
fYear
2014
Firstpage
1781
Lastpage
1786
Abstract
In this paper, a novel generalized policy iteration algorithm is investigated to solve infinite horizon optimal control problems for discrete-time nonlinear systems. Two iteration indices are introduced in the generalized policy iteration algorithm, which iterate for policy improvement and policy evaluation, respectively. For the first time the properties of monotonicity, convergence and admissibility for the generalized policy iteration algorithm are analyzed to guarantee that the iterative performance index function converges to the optimum and the iterative control law stabilizes the control system. Finally, numerical results are presented to illustrate the performance of the developed method.
Keywords
discrete time systems; dynamic programming; iterative methods; nonlinear control systems; optimal control; adaptive dynamic programming; discrete-time nonlinear systems; generalized policy iteration algorithm; infinite horizon optimal control problems; iteration indices; iterative control law; iterative performance index function; optimal learning control; policy evaluation; policy improvement; Adaptive critic designs; adaptive dynamic programming; approximate dynamic programming; generalized policy iteration; neural networks; neuro-dynamic programming; nonlinear systems; optimal control; reinforcement learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Control and Automation (WCICA), 2014 11th World Congress on
Type
conf
DOI
10.1109/WCICA.2014.7052990
Filename
7052990
Link To Document