${H}_{ {\\infty }}$ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning

Author

Modares, Hamidreza ; Lewis, Frank L. ; Zhong-Ping Jiang

Author_Institution

Univ. of Texas at Arlington Res. Inst., Fort Worth, TX, USA

Volume

26

Issue

10

fYear

2015

Firstpage

2550

Lastpage

2562

Abstract

This paper deals with the design of an H_∞ tracking controller for nonlinear continuous-time systems with completely unknown dynamics. A general bounded L₂-gain tracking problem with a discounted performance function is introduced for the H_∞ tracking. A tracking Hamilton-Jacobi-Isaac (HJI) equation is then developed that gives a Nash equilibrium solution to the associated min-max optimization problem. A rigorous analysis of bounded L₂-gain and stability of the control solution obtained by solving the tracking HJI equation is provided. An upper-bound is found for the discount factor to assure local asymptotic stability of the tracking error dynamics. An off-policy reinforcement learning algorithm is used to learn the solution to the tracking HJI equation online without requiring any knowledge of the system dynamics. Convergence of the proposed algorithm to the solution to the tracking HJI equation is shown. Simulation examples are provided to verify the effectiveness of the proposed method.

Keywords

H^∞ control; asymptotic stability; continuous time systems; control system synthesis; game theory; learning (artificial intelligence); nonlinear control systems; tracking; H_∞ tracking controller design; Nash equilibrium solution; completely unknown continuous-time systems; completely unknown dynamics; discounted performance function; general bounded L₂-gain tracking problem; local asymptotic stability; nonlinear continuous-time systems; off-policy reinforcement learning algorithm; tracking Hamilton-Jacobi-Isaac equation; tracking error dynamics; Asymptotic stability; Attenuation; Feedforward neural networks; Heuristic algorithms; Mathematical model; Optimal control; Trajectory; $H_{infty }$ tracking controller; Bounded $L_{2}$ -gain; Bounded L₂-gain; H∞ tracking controller; reinforcement learning (RL); tracking Hamilton-Jacobi-Isaac (HJI) equation.; tracking Hamilton???Jacobi???Isaac (HJI) equation;

fLanguage

English

Journal_Title

Neural Networks and Learning Systems, IEEE Transactions on

Publisher

ieee

ISSN

2162-237X

Type

jour

DOI

10.1109/TNNLS.2015.2441749

Filename

7132753