Title :
Direct heuristic dynamic programming with augmented states
Author :
Sun, Jian ; Liu, Feng ; Si, Jennie ; MEI, Shengwei
Author_Institution :
Dept. of Electr. Eng., Tsinghua Univ., Beijing, China
fDate :
July 31 2011-Aug. 5 2011
Abstract :
This paper addresses a design issue of an approximate dynamic programming structure and its respective convergence property. Specifically, we propose to impose a PID structure to the action and critic networks in the direct heuristic dynamic programming (direct HDP) online learning controller. We demonstrate that the direct HDP with such PID augmented states improves convergence speed and that it out performs the traditional PID even though the learning controller may be initialized to be like a PID. Also for the first time, by using a Lyapnov approach we show that the action and critic network weights retain the property of uniformly ultimate boundedness (UUB) under mild conditions.
Keywords :
Lyapunov methods; adaptive control; convergence; dynamic programming; learning systems; three-term control; Lyapnov approach; PID structure; action network; augmented states; convergence property; critic network; direct heuristic dynamic programming; online learning controller; uniformly ultimate boundedness; Convergence; Dynamic programming; Equations; Function approximation; Mathematical model; Optimal control; Approximate Dynamic Programming (ADP); Direct Heuristic Dynamic Programming (direct HDP); Feedforward Neural Network with Augmented states (AFNN);
Conference_Titel :
Neural Networks (IJCNN), The 2011 International Joint Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-9635-8
DOI :
10.1109/IJCNN.2011.6033633