• DocumentCode
    3572793
  • Title

    Optimal learning control for discrete-time nonlinear systems using generalized policy iteration based adaptive dynamic programming

  • Author

    Qinglai Wei ; Derong Liu

  • Author_Institution
    State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
  • fYear
    2014
  • Firstpage
    1781
  • Lastpage
    1786
  • Abstract
    In this paper, a novel generalized policy iteration algorithm is investigated to solve infinite horizon optimal control problems for discrete-time nonlinear systems. Two iteration indices are introduced in the generalized policy iteration algorithm, which iterate for policy improvement and policy evaluation, respectively. For the first time the properties of monotonicity, convergence and admissibility for the generalized policy iteration algorithm are analyzed to guarantee that the iterative performance index function converges to the optimum and the iterative control law stabilizes the control system. Finally, numerical results are presented to illustrate the performance of the developed method.
  • Keywords
    discrete time systems; dynamic programming; iterative methods; nonlinear control systems; optimal control; adaptive dynamic programming; discrete-time nonlinear systems; generalized policy iteration algorithm; infinite horizon optimal control problems; iteration indices; iterative control law; iterative performance index function; optimal learning control; policy evaluation; policy improvement; Adaptive critic designs; adaptive dynamic programming; approximate dynamic programming; generalized policy iteration; neural networks; neuro-dynamic programming; nonlinear systems; optimal control; reinforcement learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control and Automation (WCICA), 2014 11th World Congress on
  • Type

    conf

  • DOI
    10.1109/WCICA.2014.7052990
  • Filename
    7052990