DocumentCode
188859
Title
Policy iteration algorithm based on experience replay to solve H∞ control problem of partially unknown nonlinear systems
Author
Yasini, S. ; Naghibi Sistani, M.B. ; Karimpour, Ali
Author_Institution
Electr. Eng. Dept., Ferdowsi Univ. of Mashhad, Mashhad, Iran
fYear
2014
fDate
24-27 June 2014
Firstpage
2103
Lastpage
2108
Abstract
In this paper, an online adaptive optimal control algorithm based on policy iteration (PI) is developed to solve the H∞ control problem of partially unknown nonlinear continuous-time (CT) systems. The convergence of existing PI algorithms for solving the H∞ control is guaranteed under the restrictive persistency of excitation (PE) condition. By using the idea of experience replay this condition is relaxed here to a simplified rank condition which is easy to verify online. This is achieved by using previously stored data concurrently with current data for updating the critic NN weights. The proposed algorithm is implemented on actor-critic-disturbance neural network (NN) structure, where all NNs are tuned at the same time to obtain the solution of the Hamilton-Jacobi-Isaacs (HJI) equation, without requiring the information on the internal system dynamics. The stability of the closed-loop system is guaranteed and the convergence to the optimal solution is obtained. Simulation results show the effectiveness of the proposed method.
Keywords
H∞ control; adaptive control; closed loop systems; continuous time systems; convergence of numerical methods; iterative methods; neurocontrollers; nonlinear control systems; stability; CT systems; H∞ control problem; HJI equation; Hamilton-Jacobi-Isaacs equation; PE condition; PI algorithm; actor-critic-disturbance neural network structure; closed-loop system stability; critic NN weight updating; experience replay; internal system dynamics; online adaptive optimal control algorithm; partially unknown nonlinear continuous-time systems; policy iteration algorithm; restrictive persistency of excitation condition; simplified rank condition; Approximation methods; Artificial neural networks; Convergence; Equations; Games; Mathematical model; Optimal control;
fLanguage
English
Publisher
ieee
Conference_Titel
Control Conference (ECC), 2014 European
Conference_Location
Strasbourg
Print_ISBN
978-3-9524269-1-3
Type
conf
DOI
10.1109/ECC.2014.6862229
Filename
6862229
Link To Document