• DocumentCode
    582035
  • Title

    Neural-network-based optimal control for discrete-time nonlinear systems using general value iteration

  • Author

    Hongliang, Li ; Derong, Liu ; Ding, Andwang

  • Author_Institution
    State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
  • fYear
    2012
  • fDate
    25-27 July 2012
  • Firstpage
    2932
  • Lastpage
    2937
  • Abstract
    In this paper, we propose a novel adaptive dynamic programming (ADP) scheme based on general value iteration to obtain near optimal control for discrete-time nonlinear systems with continuous state and control space. First, the selection of initial value function is different from the traditional value iteration, and a new method is introduced to demonstrate the convergence property and convergence speed of the value function. Then, the control law obtained at each iteration can stabilize the system under some conditions. At last, three neural networks with Levenberg-Marquardt training algorithm are used to approximate the unknown nonlinear system, the value function and the optimal control law. One simulation example is presented to demonstrate the effectiveness of the present scheme.
  • Keywords
    adaptive control; approximation theory; convergence of numerical methods; discrete time systems; dynamic programming; initial value problems; iterative methods; neurocontrollers; nonlinear control systems; optimal control; stability; ADP scheme; Levenberg-Marquardt training algorithm; adaptive dynamic programming scheme; continuous control space; continuous state space; convergence property; convergence speed; discrete-time nonlinear systems; general value iteration; initial value function selection; near optimal control; neural-network-based optimal control; Approximation algorithms; Approximation methods; Artificial neural networks; Convergence; Equations; Nonlinear systems; Optimal control; Adaptive dynamic programming; approximate dynamic programming; neural networks; optimal control; reinforcement learning; value iteration;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control Conference (CCC), 2012 31st Chinese
  • Conference_Location
    Hefei
  • ISSN
    1934-1768
  • Print_ISBN
    978-1-4673-2581-3
  • Type

    conf

  • Filename
    6390424