• DocumentCode
    84926
  • Title

    Infinite Horizon Self-Learning Optimal Control of Nonaffine Discrete-Time Nonlinear Systems

  • Author

    Qinglai Wei ; Derong Liu ; Xiong Yang

  • Author_Institution
    State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
  • Volume
    26
  • Issue
    4
  • fYear
    2015
  • fDate
    Apr-15
  • Firstpage
    866
  • Lastpage
    879
  • Abstract
    In this paper, a novel iterative adaptive dynamic programming (ADP)-based infinite horizon self-learning optimal control algorithm, called generalized policy iteration algorithm, is developed for nonaffine discrete-time (DT) nonlinear systems. Generalized policy iteration algorithm is a general idea of interacting policy and value iteration algorithms of ADP. The developed generalized policy iteration algorithm permits an arbitrary positive semidefinite function to initialize the algorithm, where two iteration indices are used for policy improvement and policy evaluation, respectively. It is the first time that the convergence, admissibility, and optimality properties of the generalized policy iteration algorithm for DT nonlinear systems are analyzed. Neural networks are used to implement the developed algorithm. Finally, numerical examples are presented to illustrate the performance of the developed algorithm.
  • Keywords
    discrete time systems; dynamic programming; iterative methods; nonlinear control systems; optimal control; ADP; DT nonlinear systems; arbitrary positive semidefinite function; generalized policy iteration algorithm; interacting policy; nonaffine discrete-time nonlinear systems; novel iterative adaptive dynamic programming based infinite horizon self-learning optimal control algorithm; policy evaluation; policy improvement; value iteration algorithms; Algorithm design and analysis; Convergence; Heuristic algorithms; Nickel; Nonlinear systems; Optimal control; Performance analysis; Adaptive critic designs; adaptive dynamic programming (ADP); approximate dynamic programming; generalized policy iteration; neural networks (NNs); neurodynamic programming; nonlinear systems; optimal control; reinforcement learning; reinforcement learning.;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2015.2401334
  • Filename
    7052401