DocumentCode :
1797779
Title :
Finite horizon stochastic optimal control of nonlinear two-player zero-sum games under communication constraint
Author :
Hao Xu ; Jagannathan, Sarangapani
Author_Institution :
Dept. of Electr. & Comput. Eng., Missouri Univ. of Sci. & Technol., Rolla, MO, USA
fYear :
2014
fDate :
6-11 July 2014
Firstpage :
239
Lastpage :
244
Abstract :
In this paper, the finite horizon stochastic optimal control of nonlinear two-player zero-sum games, referred to as Nonlinear Networked Control Systems (NNCS) two-player zero-sum game, between control and disturbance input players in the presence of unknown system dynamics and a communication network with delays and packet losses is addressed by using neuro dynamic programming (NDP). The overall objective being to find the optimal control input while maximizing the disturbance attenuation. First, a novel online neural network (NN) identifier is introduced to estimate the unknown control and disturbance coefficient matrices which are needed in the generation of optimal control input. Then, the critic and two actor NNs have been introduced to learn the time-varying solution to the Hamilton-Jacobi-Isaacs (HJI) equation and determine the stochastic optimal control and disturbance policies in a forward-in-time manner. Eventually, with the proposed novel NN weight update laws, Lyapunov theory is utilized to demonstrate that all closed-loop signals and NN weights are uniformly ultimately bounded (ÜUB) during the finite horizon with ultimate bounds being a function of initial conditions and final time. Further, the approximated control input and disturbance signals tend close to the saddle-point equilibrium within finite-time. Simulation results are included.
Keywords :
dynamic programming; game theory; networked control systems; nonlinear control systems; optimal control; stochastic systems; time-varying systems; Hamilton-Jacobi-Isaacs equation; Lyapunov theory; NDP; communication constraint; disturbance coefficient matrices; finite horizon; finite horizon stochastic optimal control; neuro dynamic programming; nonlinear networked control system; nonlinear two-player zero-sum games; online neural network identifier; saddle-point equilibrium; stochastic optimal control; time-varying solution; uniformly ultimately bounded; unknown control matrix; Artificial neural networks; Delays; Equations; Game theory; Games; Optimal control; Packet loss;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
Type :
conf
DOI :
10.1109/IJCNN.2014.6889617
Filename :
6889617
Link To Document :
بازگشت