مرکز منطقه ای اطلاع رساني علوم و فناوري - A novel actor–critic–identifier architecture for approximate optimal control of uncertain nonlinear systems

Title of article :

A novel actor–critic–identifier architecture for approximate optimal control of uncertain nonlinear systems

Author/Authors :

Bhasin، نويسنده , , S. and Kamalapurkar، نويسنده , , R. and Johnson، نويسنده , , M. and Vamvoudakis، نويسنده , , K.G. and Lewis، نويسنده , , F.L. and Dixon، نويسنده , , W.E.، نويسنده ,

Issue Information :

روزنامه با شماره پیاپی سال 2013

Pages :

From page :

To page :

Abstract :

An online adaptive reinforcement learning-based solution is developed for the infinite-horizon optimal control problem for continuous-time uncertain nonlinear systems. A novel actor–critic–identifier (ACI) is proposed to approximate the Hamilton–Jacobi–Bellman equation using three neural network (NN) structures—actor and critic NNs approximate the optimal control and the optimal value function, respectively, and a robust dynamic neural network identifier asymptotically approximates the uncertain system dynamics. An advantage of using the ACI architecture is that learning by the actor, critic, and identifier is continuous and simultaneous, without requiring knowledge of system drift dynamics. Convergence of the algorithm is analyzed using Lyapunov-based adaptive control methods. A persistence of excitation condition is required to guarantee exponential convergence to a bounded region in the neighborhood of the optimal control and uniformly ultimately bounded (UUB) stability of the closed-loop system. Simulation results demonstrate the performance of the actor–critic–identifier method for approximate optimal control.

Keywords :

optimal control , Approximate Dynamic Programming , Learning control , Actor–critic–identifier , Adaptive control

Journal title :

Automatica

Serial Year :

2013

Journal title :

Automatica

Record number :

1448968

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=1448968