Title :
Neural Network-Based Finite Horizon Stochastic Optimal Control Design for Nonlinear Networked Control Systems
Author :
Hao Xu ; Jagannathan, Sarangapani
Author_Institution :
Dept. of Eng., Univ. of Tennessee at Martin, Martin, TN, USA
Abstract :
The stochastic optimal control of nonlinear networked control systems (NNCSs) using neuro-dynamic programming (NDP) over a finite time horizon is a challenging problem due to terminal constraints, system uncertainties, and unknown network imperfections, such as network-induced delays and packet losses. Since the traditional iteration or time-based infinite horizon NDP schemes are unsuitable for NNCS with terminal constraints, a novel time-based NDP scheme is developed to solve finite horizon optimal control of NNCS by mitigating the above-mentioned challenges. First, an online neural network (NN) identifier is introduced to approximate the control coefficient matrix that is subsequently utilized in conjunction with the critic and actor NNs to determine a time-based stochastic optimal control input over finite horizon in a forward-in-time and online manner. Eventually, Lyapunov theory is used to show that all closed-loop signals and NN weights are uniformly ultimately bounded with ultimate bounds being a function of initial conditions and final time. Moreover, the approximated control input converges close to optimal value within finite time. The simulation results are included to show the effectiveness of the proposed scheme.
Keywords :
Lyapunov methods; closed loop systems; control system synthesis; delays; dynamic programming; matrix algebra; networked control systems; neurocontrollers; nonlinear control systems; optimal control; stochastic systems; Lyapunov theory; NN weights; NNCS; closed-loop signals; control coefficient matrix; finite time horizon; network-induced delays; neural network-based finite horizon stochastic optimal control design; neuro-dynamic programming; nonlinear networked control systems; online neural network identifier; packet losses; stochastic optimal control; system uncertainties; terminal constraints; time-based NDP scheme; unknown network imperfections; Artificial neural networks; Delays; Estimation error; Optimal control; Packet loss; Stability analysis; Neuro-dynamic programming (NDP); nonlinear networked control system (NNCS); stochastic optimal control; stochastic optimal control.;
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
DOI :
10.1109/TNNLS.2014.2315622