• DocumentCode
    117477
  • Title

    Robust policy updates for stochastic optimal control

  • Author

    Rueckert, Elmar ; Mindt, Max ; Peters, Jan ; Neumann, Gerhard

  • Author_Institution
    Intell. Autonomous Syst. Lab., Tech. Univ. Darmstadt, Darmstadt, Germany
  • fYear
    2014
  • fDate
    18-20 Nov. 2014
  • Firstpage
    388
  • Lastpage
    393
  • Abstract
    For controlling high-dimensional robots, most stochastic optimal control algorithms use approximations of the system dynamics and of the cost function (e.g., using linearizations and Taylor expansions). These approximations are typically only locally correct, which might cause instabilities in the greedy policy updates, lead to oscillations or the algorithms diverge. To overcome these drawbacks, we add a regularization term to the cost function that punishes large policy update steps in the trajectory optimization procedure. We applied this concept to the Approximate Inference Control method (AICO), where the resulting algorithm guarantees convergence for uninformative initial solutions without complex hand-tuning of learning rates. We evaluated our new algorithm on two simulated robotic platforms. A robot arm with five joints was used for reaching multiple targets while keeping the roll angle constant. On the humanoid robot Nao, we show how complex skills like reaching and balancing can be inferred from desired center of gravity or end effector coordinates.
  • Keywords
    approximation theory; end effectors; humanoid robots; optimal control; robot dynamics; stochastic systems; AICO; Nao humanoid robot; approximate inference control method; cost function; end effector; greedy policy updates; high-dimensional robots; regularization term; robot arm; robust policy updates; stochastic optimal control; system dynamics approximations; trajectory optimization procedure; uninformative initial solutions; Approximation algorithms; Approximation methods; Gravity; Planning; Robot kinematics; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Humanoid Robots (Humanoids), 2014 14th IEEE-RAS International Conference on
  • Conference_Location
    Madrid
  • Type

    conf

  • DOI
    10.1109/HUMANOIDS.2014.7041389
  • Filename
    7041389