• DocumentCode
    57002
  • Title

    Reinforcement Learning Design-Based Adaptive Tracking Control With Less Learning Parameters for Nonlinear Discrete-Time MIMO Systems

  • Author

    Yan-Jun Liu ; Li Tang ; Shaocheng Tong ; Chen, C.L.P. ; Dong-Juan Li

  • Author_Institution
    Coll. of Sci., Liaoning Univ. of Technol., Jinzhou, China
  • Volume
    26
  • Issue
    1
  • fYear
    2015
  • fDate
    Jan. 2015
  • Firstpage
    165
  • Lastpage
    176
  • Abstract
    Based on the neural network (NN) approximator, an online reinforcement learning algorithm is proposed for a class of affine multiple input and multiple output (MIMO) nonlinear discrete-time systems with unknown functions and disturbances. In the design procedure, two networks are provided where one is an action network to generate an optimal control signal and the other is a critic network to approximate the cost function. An optimal control signal and adaptation laws can be generated based on two NNs. In the previous approaches, the weights of critic and action networks are updated based on the gradient descent rule and the estimations of optimal weight vectors are directly adjusted in the design. Consequently, compared with the existing results, the main contributions of this paper are: (1) only two parameters are needed to be adjusted, and thus the number of the adaptation laws is smaller than the previous results and (2) the updating parameters do not depend on the number of the subsystems for MIMO systems and the tuning rules are replaced by adjusting the norms on optimal weight vectors in both action and critic networks. It is proven that the tracking errors, the adaptation laws, and the control inputs are uniformly bounded using Lyapunov analysis method. The simulation examples are employed to illustrate the effectiveness of the proposed algorithm.
  • Keywords
    Lyapunov methods; MIMO systems; adaptive control; approximation theory; control system synthesis; discrete time systems; gradient methods; learning (artificial intelligence); neurocontrollers; nonlinear control systems; optimal control; Lyapunov analysis method; NN approximator; action network weight update; adaptation laws; affine multiple input-and-multiple output nonlinear discrete-time systems; cost function approximation; critic network weight update; gradient descent rule; learning parameters; neural network approximator; nonlinear discrete-time MIMO systems; online reinforcement learning algorithm; optimal control signal; optimal weight vector estimation; optimal weight vectors; parameter update; reinforcement learning design-based adaptive tracking control; tracking errors; tuning rules; uniformly bounded control inputs; unknown disturbances; unknown functions; Algorithm design and analysis; Approximation methods; Artificial neural networks; Estimation; MIMO; Optimal control; Vectors; Adaptive control; discrete-time systems; online approximators; reinforcement learning (RL); uncertain nonlinear systems; uncertain nonlinear systems.;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2014.2360724
  • Filename
    6966778