Policy iteration approximate dynamic programming using Volterra series based actor

Author

Wentao Guo ; Si, Jennie ; Feng Liu ; Shengwei Mei

Author_Institution

Dept. of Electr. Eng., Tsinghua Univ., Beijing, China

fYear

2014

fDate

6-11 July 2014

Firstpage

249

Lastpage

255

Abstract

There is an extensive literature on value function approximation for approximate dynamic programming (ADP). Multilayer perceptrons (MLPs) and radial basis functions (RBFs), among others, are typical approximators for value functions in ADP. Similar approaches have been taken for policy approximation. In this paper, we propose a new Volterra series based structure for actor approximation in ADP. The Volterra approx-imator is linear in parameters with global optima attainable. Given the proposed approximator structures, we further develop a policy iteration framework under which a gradient descent training algorithm for obtaining the optimal Volterra kernels can be obtained. Associated with this ADP design, we provide a sufficient condition based on actor approximation error to guarantee convergence of the value function iterations. A finite bound of the final convergent value function is also given. Finally, by using a simulation example we illustrate the effectiveness of the proposed Volterra actor for optimal control of a nonlinear system.

Keywords

Volterra series; dynamic programming; multilayer perceptrons; radial basis function networks; ADP design; MLP; RBF; Volterra actor; Volterra approximator; Volterra series based actor; Volterra series based structure; actor approximation error; approximator structures; finite bound; gradient descent training algorithm; multilayer perceptrons; nonlinear system; optimal Volterra kernels; optimal control; policy approximation; policy iteration approximate dynamic programming; policy iteration framework; radial basis functions; value function approximation; value function iterations; Approximation algorithms; Approximation error; Convergence; Function approximation; Kernel; Optimal control;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks (IJCNN), 2014 International Joint Conference on

Conference_Location

Beijing

Print_ISBN

978-1-4799-6627-1

Type

conf

DOI

10.1109/IJCNN.2014.6889865

Filename

6889865