• DocumentCode
    2497928
  • Title

    High-order local dynamic programming

  • Author

    Tassa, Yuval ; Todorov, Emanuel

  • Author_Institution
    Interdiscipl. Center for Neural Comput., Hebrew Univ., Jerusalem, Israel
  • fYear
    2011
  • fDate
    11-15 April 2011
  • Firstpage
    70
  • Lastpage
    75
  • Abstract
    We describe a new local dynamic programming algorithm for solving stochastic continuous Optimal Control problems. We use cubature integration to both propagate the state distribution and perform the Bellman backup. The algorithm can approximate the local policy and cost-to-go with arbitrary function bases. We compare the classic quadratic cost-to-go/linear-feedback controller to a cubic cost-to-go/quadratic policy controller on a 10-dimensional simulated swimming robot, and find that the higher order approximation yields a more general policy with a larger basin of attraction.
  • Keywords
    continuous systems; dynamic programming; feedback; linear systems; mobile robots; optimal control; stochastic systems; 10-dimensional simulated swimming robot; Bellman backup; arbitrary function bases; cubature integration; cubic cost-to-go controller; high-order local dynamic programming; linear-feedback controller; quadratic cost-to-go controller; quadratic policy controller; state distribution; stochastic continuous optimal control problem; Approximation methods; Dynamic programming; Equations; Heuristic algorithms; Mathematical model; Noise; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
  • Conference_Location
    Paris
  • Print_ISBN
    978-1-4244-9887-1
  • Type

    conf

  • DOI
    10.1109/ADPRL.2011.5967350
  • Filename
    5967350