• DocumentCode
    1527900
  • Title

    Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update

  • Author

    Dierks, T. ; Jagannathan, S.

  • Author_Institution
    DRS Sustainment Syst., Inc., St. Louis, MO, USA
  • Volume
    23
  • Issue
    7
  • fYear
    2012
  • fDate
    7/1/2012 12:00:00 AM
  • Firstpage
    1118
  • Lastpage
    1129
  • Abstract
    In this paper, the Hamilton-Jacobi-Bellman equation is solved forward-in-time for the optimal control of a class of general affine nonlinear discrete-time systems without using value and policy iterations. The proposed approach, referred to as adaptive dynamic programming, uses two neural networks (NNs), to solve the infinite horizon optimal regulation control of affine nonlinear discrete-time systems in the presence of unknown internal dynamics and a known control coefficient matrix. One NN approximates the cost function and is referred to as the critic NN, while the second NN generates the control input and is referred to as the action NN. The cost function and policy are updated once at the sampling instant and thus the proposed approach can be referred to as time-based ADP. Novel update laws for tuning the unknown weights of the NNs online are derived. Lyapunov techniques are used to show that all signals are uniformly ultimately bounded and that the approximated control signal approaches the optimal control input with small bounded error over time. In the absence of disturbances, an optimal control is demonstrated. Simulation results are included to show the effectiveness of the approach. The end result is the systematic design of an optimal controller with guaranteed convergence that is suitable for hardware implementation.
  • Keywords
    discrete time systems; dynamic programming; neural nets; nonlinear systems; optical waveguide filters; optimal control; Hamilton-Jacobi-Bellman equation; Lyapunov techniques; NN; adaptive dynamic programming; approximated control signal; control coefficient matrix; forward-in-time; general affine nonlinear discrete-time systems; infinite horizon optimal regulation control; neural networks; online optimal control; optimal controller; systematic design; time-based ADP; time-based policy update; unknown internal dynamics; Approximation methods; Artificial neural networks; Cost function; Estimation error; Nonlinear dynamical systems; Optimal control; Hamilton–Jacobi–Bellman; online approximators; online nonlinear optimal control; time-based policy update;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2012.2196708
  • Filename
    6208889