• DocumentCode
    2498136
  • Title

    Online adaptive learning of optimal control solutions using integral reinforcement learning

  • Author

    Vamvoudakis, Kyriakos G. ; Vrabie, Draguna ; Lewis, Frank L.

  • Author_Institution
    Autom. & Robot. Res. Inst., Univ. of Texas at Arlington, Fort Worth, TX, USA
  • fYear
    2011
  • fDate
    11-15 April 2011
  • Firstpage
    250
  • Lastpage
    257
  • Abstract
    In this paper we introduce an online algorithm that uses integral reinforcement knowledge for learning the continuous-time optimal control solution for nonlinear systems with infinite horizon costs and partial knowledge of the system dynamics. This algorithm is a data based approach to the solution of the Hamilton-Jacobi-Bellman equation and it does not require explicit knowledge on the system´s drift dynamics. The adaptive algorithm is based on policy iteration, and it is implemented on an actor/critic structure. Both actor and critic neural networks are adapted simultaneously a persistence of excitation condition is required to guarantee convergence of the critic to the actual optimal value function. Novel tuning algorithms are given for both critic and actor networks, with extra terms in the actor tuning law being required to guarantee closed-loop dynamical stability. The convergence to the optimal controller is proven, and stability of the system is also guaranteed. Simulation examples support the theoretical result.
  • Keywords
    Jacobian matrices; adaptive control; closed loop systems; continuous time systems; iterative methods; learning (artificial intelligence); learning systems; neurocontrollers; nonlinear control systems; optimal control; stability; Hamilton-Jacobi-Bellman equation; actor neural networks; closed-loop dynamical stability; continuous-time optimal control solution; critic neural networks; infinite horizon costs; integral reinforcement knowledge; integral reinforcement learning; nonlinear systems; online adaptive learning; policy iteration; system dynamics; tuning algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
  • Conference_Location
    Paris
  • Print_ISBN
    978-1-4244-9887-1
  • Type

    conf

  • DOI
    10.1109/ADPRL.2011.5967359
  • Filename
    5967359