مرکز منطقه ای اطلاع رساني علوم و فناوري - Finite horizon stochastic optimal control of nonlinear two-player zero-sum games under communication constraint

DocumentCode :

1797779

Title :

Finite horizon stochastic optimal control of nonlinear two-player zero-sum games under communication constraint

Author :

Hao Xu ; Jagannathan, Sarangapani

Author_Institution :

Dept. of Electr. & Comput. Eng., Missouri Univ. of Sci. & Technol., Rolla, MO, USA

fYear :

2014

fDate :

6-11 July 2014

Firstpage :

239

Lastpage :

244

Abstract :

In this paper, the finite horizon stochastic optimal control of nonlinear two-player zero-sum games, referred to as Nonlinear Networked Control Systems (NNCS) two-player zero-sum game, between control and disturbance input players in the presence of unknown system dynamics and a communication network with delays and packet losses is addressed by using neuro dynamic programming (NDP). The overall objective being to find the optimal control input while maximizing the disturbance attenuation. First, a novel online neural network (NN) identifier is introduced to estimate the unknown control and disturbance coefficient matrices which are needed in the generation of optimal control input. Then, the critic and two actor NNs have been introduced to learn the time-varying solution to the Hamilton-Jacobi-Isaacs (HJI) equation and determine the stochastic optimal control and disturbance policies in a forward-in-time manner. Eventually, with the proposed novel NN weight update laws, Lyapunov theory is utilized to demonstrate that all closed-loop signals and NN weights are uniformly ultimately bounded (ÜUB) during the finite horizon with ultimate bounds being a function of initial conditions and final time. Further, the approximated control input and disturbance signals tend close to the saddle-point equilibrium within finite-time. Simulation results are included.

Keywords :

dynamic programming; game theory; networked control systems; nonlinear control systems; optimal control; stochastic systems; time-varying systems; Hamilton-Jacobi-Isaacs equation; Lyapunov theory; NDP; communication constraint; disturbance coefficient matrices; finite horizon; finite horizon stochastic optimal control; neuro dynamic programming; nonlinear networked control system; nonlinear two-player zero-sum games; online neural network identifier; saddle-point equilibrium; stochastic optimal control; time-varying solution; uniformly ultimately bounded; unknown control matrix; Artificial neural networks; Delays; Equations; Game theory; Games; Optimal control; Packet loss;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), 2014 International Joint Conference on

Conference_Location :

Beijing

Print_ISBN :

978-1-4799-6627-1

Type :

conf

DOI :

10.1109/IJCNN.2014.6889617

Filename :

6889617

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1797779