DocumentCode :
3583880
Title :
Approximate dynamic programming via sum of squares programming
Author :
Summers, Tyler H. ; Kunz, Konstantin ; Kariotoglou, Nikolaos ; Kamgarpour, Maryam ; Summers, Sean ; Lygeros, John
Author_Institution :
ETH Zurich, Zurich, Switzerland
fYear :
2013
Firstpage :
191
Lastpage :
197
Abstract :
We describe an approximate dynamic programming method for stochastic control problems on infinite state and input spaces. The optimal value function is approximated by a linear combination of basis functions with coefficients as decision variables. By relaxing the Bellman equation to an inequality, one obtains a linear program in the basis coefficients with an infinite set of constraints. We show that a recently introduced method, which obtains convex quadratic value function approximations, can be extended to higher order polynomial approximations via sum of squares programming techniques. An approximate value function can then be computed offline by solving a semidefinite program, without having to sample the infinite constraint. The policy is evaluated online by solving a polynomial optimization problem, which also turns out to be convex in some cases. We experimentally validate the method on an autonomous helicopter testbed using a 10-dimensional helicopter model.
Keywords :
approximation theory; dynamic programming; function approximation; polynomial approximation; stochastic systems; 10-dimensional helicopter model; Bellman equation; approximate dynamic programming; approximate value function; autonomous helicopter testbed; convex quadratic value function approximations; higher order polynomial approximations; infinite state space; input space; linear program; optimal value function; polynomial optimization problem; stochastic control problems; sum of squares programming technique; Dynamic programming; Function approximation; Helicopters; Optimization; Polynomials; Programming;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Conference (ECC), 2013 European
Type :
conf
Filename :
6669374
Link To Document :
بازگشت