Title :
Policy iteration algorithm based on experience replay to solve H∞ control problem of partially unknown nonlinear systems
Author :
Yasini, S. ; Naghibi Sistani, M.B. ; Karimpour, Ali
Author_Institution :
Electr. Eng. Dept., Ferdowsi Univ. of Mashhad, Mashhad, Iran
Abstract :
In this paper, an online adaptive optimal control algorithm based on policy iteration (PI) is developed to solve the H∞ control problem of partially unknown nonlinear continuous-time (CT) systems. The convergence of existing PI algorithms for solving the H∞ control is guaranteed under the restrictive persistency of excitation (PE) condition. By using the idea of experience replay this condition is relaxed here to a simplified rank condition which is easy to verify online. This is achieved by using previously stored data concurrently with current data for updating the critic NN weights. The proposed algorithm is implemented on actor-critic-disturbance neural network (NN) structure, where all NNs are tuned at the same time to obtain the solution of the Hamilton-Jacobi-Isaacs (HJI) equation, without requiring the information on the internal system dynamics. The stability of the closed-loop system is guaranteed and the convergence to the optimal solution is obtained. Simulation results show the effectiveness of the proposed method.
Keywords :
H∞ control; adaptive control; closed loop systems; continuous time systems; convergence of numerical methods; iterative methods; neurocontrollers; nonlinear control systems; stability; CT systems; H∞ control problem; HJI equation; Hamilton-Jacobi-Isaacs equation; PE condition; PI algorithm; actor-critic-disturbance neural network structure; closed-loop system stability; critic NN weight updating; experience replay; internal system dynamics; online adaptive optimal control algorithm; partially unknown nonlinear continuous-time systems; policy iteration algorithm; restrictive persistency of excitation condition; simplified rank condition; Approximation methods; Artificial neural networks; Convergence; Equations; Games; Mathematical model; Optimal control;
Conference_Titel :
Control Conference (ECC), 2014 European
Conference_Location :
Strasbourg
Print_ISBN :
978-3-9524269-1-3
DOI :
10.1109/ECC.2014.6862229