Title :
High-order local dynamic programming
Author :
Tassa, Yuval ; Todorov, Emanuel
Author_Institution :
Interdiscipl. Center for Neural Comput., Hebrew Univ., Jerusalem, Israel
Abstract :
We describe a new local dynamic programming algorithm for solving stochastic continuous Optimal Control problems. We use cubature integration to both propagate the state distribution and perform the Bellman backup. The algorithm can approximate the local policy and cost-to-go with arbitrary function bases. We compare the classic quadratic cost-to-go/linear-feedback controller to a cubic cost-to-go/quadratic policy controller on a 10-dimensional simulated swimming robot, and find that the higher order approximation yields a more general policy with a larger basin of attraction.
Keywords :
continuous systems; dynamic programming; feedback; linear systems; mobile robots; optimal control; stochastic systems; 10-dimensional simulated swimming robot; Bellman backup; arbitrary function bases; cubature integration; cubic cost-to-go controller; high-order local dynamic programming; linear-feedback controller; quadratic cost-to-go controller; quadratic policy controller; state distribution; stochastic continuous optimal control problem; Approximation methods; Dynamic programming; Equations; Heuristic algorithms; Mathematical model; Noise; Trajectory;
Conference_Titel :
Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-9887-1
DOI :
10.1109/ADPRL.2011.5967350