Direct heuristic dynamic programming with augmented states

Author

Sun, Jian ; Liu, Feng ; Si, Jennie ; MEI, Shengwei

Author_Institution

Dept. of Electr. Eng., Tsinghua Univ., Beijing, China

fYear

2011

fDate

July 31 2011-Aug. 5 2011

Firstpage

3112

Lastpage

3119

Abstract

This paper addresses a design issue of an approximate dynamic programming structure and its respective convergence property. Specifically, we propose to impose a PID structure to the action and critic networks in the direct heuristic dynamic programming (direct HDP) online learning controller. We demonstrate that the direct HDP with such PID augmented states improves convergence speed and that it out performs the traditional PID even though the learning controller may be initialized to be like a PID. Also for the first time, by using a Lyapnov approach we show that the action and critic network weights retain the property of uniformly ultimate boundedness (UUB) under mild conditions.

Keywords

Lyapunov methods; adaptive control; convergence; dynamic programming; learning systems; three-term control; Lyapnov approach; PID structure; action network; augmented states; convergence property; critic network; direct heuristic dynamic programming; online learning controller; uniformly ultimate boundedness; Convergence; Dynamic programming; Equations; Function approximation; Mathematical model; Optimal control; Approximate Dynamic Programming (ADP); Direct Heuristic Dynamic Programming (direct HDP); Feedforward Neural Network with Augmented states (AFNN);

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks (IJCNN), The 2011 International Joint Conference on

Conference_Location

San Jose, CA

ISSN

2161-4393

Print_ISBN

978-1-4244-9635-8

Type

conf

DOI

10.1109/IJCNN.2011.6033633

Filename

6033633