DocumentCode
1798260
Title
Policy iteration approximate dynamic programming using Volterra series based actor
Author
Wentao Guo ; Si, Jennie ; Feng Liu ; Shengwei Mei
Author_Institution
Dept. of Electr. Eng., Tsinghua Univ., Beijing, China
fYear
2014
fDate
6-11 July 2014
Firstpage
249
Lastpage
255
Abstract
There is an extensive literature on value function approximation for approximate dynamic programming (ADP). Multilayer perceptrons (MLPs) and radial basis functions (RBFs), among others, are typical approximators for value functions in ADP. Similar approaches have been taken for policy approximation. In this paper, we propose a new Volterra series based structure for actor approximation in ADP. The Volterra approx-imator is linear in parameters with global optima attainable. Given the proposed approximator structures, we further develop a policy iteration framework under which a gradient descent training algorithm for obtaining the optimal Volterra kernels can be obtained. Associated with this ADP design, we provide a sufficient condition based on actor approximation error to guarantee convergence of the value function iterations. A finite bound of the final convergent value function is also given. Finally, by using a simulation example we illustrate the effectiveness of the proposed Volterra actor for optimal control of a nonlinear system.
Keywords
Volterra series; dynamic programming; multilayer perceptrons; radial basis function networks; ADP design; MLP; RBF; Volterra actor; Volterra approximator; Volterra series based actor; Volterra series based structure; actor approximation error; approximator structures; finite bound; gradient descent training algorithm; multilayer perceptrons; nonlinear system; optimal Volterra kernels; optimal control; policy approximation; policy iteration approximate dynamic programming; policy iteration framework; radial basis functions; value function approximation; value function iterations; Approximation algorithms; Approximation error; Convergence; Function approximation; Kernel; Optimal control;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4799-6627-1
Type
conf
DOI
10.1109/IJCNN.2014.6889865
Filename
6889865
Link To Document