DocumentCode :
2497928
Title :
High-order local dynamic programming
Author :
Tassa, Yuval ; Todorov, Emanuel
Author_Institution :
Interdiscipl. Center for Neural Comput., Hebrew Univ., Jerusalem, Israel
fYear :
2011
fDate :
11-15 April 2011
Firstpage :
70
Lastpage :
75
Abstract :
We describe a new local dynamic programming algorithm for solving stochastic continuous Optimal Control problems. We use cubature integration to both propagate the state distribution and perform the Bellman backup. The algorithm can approximate the local policy and cost-to-go with arbitrary function bases. We compare the classic quadratic cost-to-go/linear-feedback controller to a cubic cost-to-go/quadratic policy controller on a 10-dimensional simulated swimming robot, and find that the higher order approximation yields a more general policy with a larger basin of attraction.
Keywords :
continuous systems; dynamic programming; feedback; linear systems; mobile robots; optimal control; stochastic systems; 10-dimensional simulated swimming robot; Bellman backup; arbitrary function bases; cubature integration; cubic cost-to-go controller; high-order local dynamic programming; linear-feedback controller; quadratic cost-to-go controller; quadratic policy controller; state distribution; stochastic continuous optimal control problem; Approximation methods; Dynamic programming; Equations; Heuristic algorithms; Mathematical model; Noise; Trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-9887-1
Type :
conf
DOI :
10.1109/ADPRL.2011.5967350
Filename :
5967350
Link To Document :
بازگشت