Title :
Zero-Sum Two-Player Game Theoretic Formulation of Affine Nonlinear Discrete-Time Systems Using Neural Networks
Author :
Mehraeen, Shahab ; Dierks, Travis ; Jagannathan, Sarangapani ; Crow, Mariesa L.
Author_Institution :
Dept. of Electr. & Comput. Eng., Louisiana State Univ., Baton Rouge, LA, USA
Abstract :
In this paper, the nearly optimal solution for discrete-time (DT) affine nonlinear control systems in the presence of partially unknown internal system dynamics and disturbances is considered. The approach is based on successive approximate solution of the Hamilton-Jacobi-Isaacs (HJI) equation, which appears in optimal control. Successive approximation approach for updating control and disturbance inputs for DT nonlinear affine systems are proposed. Moreover, sufficient conditions for the convergence of the approximate HJI solution to the saddle point are derived, and an iterative approach to approximate the HJI equation using a neural network (NN) is presented. Then, the requirement of full knowledge of the internal dynamics of the nonlinear DT system is relaxed by using a second NN online approximator. The result is a closed-loop optimal NN controller via offline learning. A numerical example is provided illustrating the effectiveness of the approach.
Keywords :
approximation theory; closed loop systems; convergence of numerical methods; discrete time systems; game theory; iterative methods; learning (artificial intelligence); neurocontrollers; nonlinear control systems; nonlinear equations; optimal control; DT nonlinear affine system; HJI equation; Hamilton-Jacobi-Isaacs equation; closed loop optimal NN controller; convergence; discrete time system; iterative approach; neural network; offline learning; successive approximation approach; sufficient condition; unknown internal system dynamics; zero sum two player game theory; Approximation methods; Equations; Game theory; Games; Nonlinear systems; Optimal control; Taylor series; Hamilton–Jacobi–Isaacs (HJI); neural networks (NNs); nonlinear discrete-time (DT) systems; optimal control;
Journal_Title :
Cybernetics, IEEE Transactions on
DOI :
10.1109/TSMCB.2012.2227253