High-order local dynamic programming

Author

Tassa, Yuval ; Todorov, Emanuel

Author_Institution

Interdiscipl. Center for Neural Comput., Hebrew Univ., Jerusalem, Israel

fYear

2011

fDate

11-15 April 2011

Firstpage

70

Lastpage

75

Abstract

We describe a new local dynamic programming algorithm for solving stochastic continuous Optimal Control problems. We use cubature integration to both propagate the state distribution and perform the Bellman backup. The algorithm can approximate the local policy and cost-to-go with arbitrary function bases. We compare the classic quadratic cost-to-go/linear-feedback controller to a cubic cost-to-go/quadratic policy controller on a 10-dimensional simulated swimming robot, and find that the higher order approximation yields a more general policy with a larger basin of attraction.

Keywords

continuous systems; dynamic programming; feedback; linear systems; mobile robots; optimal control; stochastic systems; 10-dimensional simulated swimming robot; Bellman backup; arbitrary function bases; cubature integration; cubic cost-to-go controller; high-order local dynamic programming; linear-feedback controller; quadratic cost-to-go controller; quadratic policy controller; state distribution; stochastic continuous optimal control problem; Approximation methods; Dynamic programming; Equations; Heuristic algorithms; Mathematical model; Noise; Trajectory;

fLanguage

English

Publisher

ieee

Conference_Titel

Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on

Conference_Location

Paris

Print_ISBN

978-1-4244-9887-1

Type

conf

DOI

10.1109/ADPRL.2011.5967350

Filename

5967350