مرکز منطقه ای اطلاع رساني علوم و فناوري - Policy iteration algorithm based on experience replay to solve H<inf>∞</inf> control problem of partially unknown nonlinear systems

DocumentCode :

188859

Title :

Policy iteration algorithm based on experience replay to solve H∞ control problem of partially unknown nonlinear systems

Author :

Yasini, S. ; Naghibi Sistani, M.B. ; Karimpour, Ali

Author_Institution :

Electr. Eng. Dept., Ferdowsi Univ. of Mashhad, Mashhad, Iran

fYear :

2014

fDate :

24-27 June 2014

Firstpage :

2103

Lastpage :

2108

Abstract :

In this paper, an online adaptive optimal control algorithm based on policy iteration (PI) is developed to solve the H_∞ control problem of partially unknown nonlinear continuous-time (CT) systems. The convergence of existing PI algorithms for solving the H_∞ control is guaranteed under the restrictive persistency of excitation (PE) condition. By using the idea of experience replay this condition is relaxed here to a simplified rank condition which is easy to verify online. This is achieved by using previously stored data concurrently with current data for updating the critic NN weights. The proposed algorithm is implemented on actor-critic-disturbance neural network (NN) structure, where all NNs are tuned at the same time to obtain the solution of the Hamilton-Jacobi-Isaacs (HJI) equation, without requiring the information on the internal system dynamics. The stability of the closed-loop system is guaranteed and the convergence to the optimal solution is obtained. Simulation results show the effectiveness of the proposed method.

Keywords :

H^∞ control; adaptive control; closed loop systems; continuous time systems; convergence of numerical methods; iterative methods; neurocontrollers; nonlinear control systems; stability; CT systems; H_∞ control problem; HJI equation; Hamilton-Jacobi-Isaacs equation; PE condition; PI algorithm; actor-critic-disturbance neural network structure; closed-loop system stability; critic NN weight updating; experience replay; internal system dynamics; online adaptive optimal control algorithm; partially unknown nonlinear continuous-time systems; policy iteration algorithm; restrictive persistency of excitation condition; simplified rank condition; Approximation methods; Artificial neural networks; Convergence; Equations; Games; Mathematical model; Optimal control;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Control Conference (ECC), 2014 European

Conference_Location :

Strasbourg

Print_ISBN :

978-3-9524269-1-3

Type :

conf

DOI :

10.1109/ECC.2014.6862229

Filename :

6862229

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=188859