DocumentCode
2497928
Title
High-order local dynamic programming
Author
Tassa, Yuval ; Todorov, Emanuel
Author_Institution
Interdiscipl. Center for Neural Comput., Hebrew Univ., Jerusalem, Israel
fYear
2011
fDate
11-15 April 2011
Firstpage
70
Lastpage
75
Abstract
We describe a new local dynamic programming algorithm for solving stochastic continuous Optimal Control problems. We use cubature integration to both propagate the state distribution and perform the Bellman backup. The algorithm can approximate the local policy and cost-to-go with arbitrary function bases. We compare the classic quadratic cost-to-go/linear-feedback controller to a cubic cost-to-go/quadratic policy controller on a 10-dimensional simulated swimming robot, and find that the higher order approximation yields a more general policy with a larger basin of attraction.
Keywords
continuous systems; dynamic programming; feedback; linear systems; mobile robots; optimal control; stochastic systems; 10-dimensional simulated swimming robot; Bellman backup; arbitrary function bases; cubature integration; cubic cost-to-go controller; high-order local dynamic programming; linear-feedback controller; quadratic cost-to-go controller; quadratic policy controller; state distribution; stochastic continuous optimal control problem; Approximation methods; Dynamic programming; Equations; Heuristic algorithms; Mathematical model; Noise; Trajectory;
fLanguage
English
Publisher
ieee
Conference_Titel
Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
Conference_Location
Paris
Print_ISBN
978-1-4244-9887-1
Type
conf
DOI
10.1109/ADPRL.2011.5967350
Filename
5967350
Link To Document