Title :
Reinforcement Learning Design-Based Adaptive Tracking Control With Less Learning Parameters for Nonlinear Discrete-Time MIMO Systems
Author :
Yan-Jun Liu ; Li Tang ; Shaocheng Tong ; Chen, C.L.P. ; Dong-Juan Li
Author_Institution :
Coll. of Sci., Liaoning Univ. of Technol., Jinzhou, China
Abstract :
Based on the neural network (NN) approximator, an online reinforcement learning algorithm is proposed for a class of affine multiple input and multiple output (MIMO) nonlinear discrete-time systems with unknown functions and disturbances. In the design procedure, two networks are provided where one is an action network to generate an optimal control signal and the other is a critic network to approximate the cost function. An optimal control signal and adaptation laws can be generated based on two NNs. In the previous approaches, the weights of critic and action networks are updated based on the gradient descent rule and the estimations of optimal weight vectors are directly adjusted in the design. Consequently, compared with the existing results, the main contributions of this paper are: (1) only two parameters are needed to be adjusted, and thus the number of the adaptation laws is smaller than the previous results and (2) the updating parameters do not depend on the number of the subsystems for MIMO systems and the tuning rules are replaced by adjusting the norms on optimal weight vectors in both action and critic networks. It is proven that the tracking errors, the adaptation laws, and the control inputs are uniformly bounded using Lyapunov analysis method. The simulation examples are employed to illustrate the effectiveness of the proposed algorithm.
Keywords :
Lyapunov methods; MIMO systems; adaptive control; approximation theory; control system synthesis; discrete time systems; gradient methods; learning (artificial intelligence); neurocontrollers; nonlinear control systems; optimal control; Lyapunov analysis method; NN approximator; action network weight update; adaptation laws; affine multiple input-and-multiple output nonlinear discrete-time systems; cost function approximation; critic network weight update; gradient descent rule; learning parameters; neural network approximator; nonlinear discrete-time MIMO systems; online reinforcement learning algorithm; optimal control signal; optimal weight vector estimation; optimal weight vectors; parameter update; reinforcement learning design-based adaptive tracking control; tracking errors; tuning rules; uniformly bounded control inputs; unknown disturbances; unknown functions; Algorithm design and analysis; Approximation methods; Artificial neural networks; Estimation; MIMO; Optimal control; Vectors; Adaptive control; discrete-time systems; online approximators; reinforcement learning (RL); uncertain nonlinear systems; uncertain nonlinear systems.;
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
DOI :
10.1109/TNNLS.2014.2360724