DocumentCode :
117477
Title :
Robust policy updates for stochastic optimal control
Author :
Rueckert, Elmar ; Mindt, Max ; Peters, Jan ; Neumann, Gerhard
Author_Institution :
Intell. Autonomous Syst. Lab., Tech. Univ. Darmstadt, Darmstadt, Germany
fYear :
2014
fDate :
18-20 Nov. 2014
Firstpage :
388
Lastpage :
393
Abstract :
For controlling high-dimensional robots, most stochastic optimal control algorithms use approximations of the system dynamics and of the cost function (e.g., using linearizations and Taylor expansions). These approximations are typically only locally correct, which might cause instabilities in the greedy policy updates, lead to oscillations or the algorithms diverge. To overcome these drawbacks, we add a regularization term to the cost function that punishes large policy update steps in the trajectory optimization procedure. We applied this concept to the Approximate Inference Control method (AICO), where the resulting algorithm guarantees convergence for uninformative initial solutions without complex hand-tuning of learning rates. We evaluated our new algorithm on two simulated robotic platforms. A robot arm with five joints was used for reaching multiple targets while keeping the roll angle constant. On the humanoid robot Nao, we show how complex skills like reaching and balancing can be inferred from desired center of gravity or end effector coordinates.
Keywords :
approximation theory; end effectors; humanoid robots; optimal control; robot dynamics; stochastic systems; AICO; Nao humanoid robot; approximate inference control method; cost function; end effector; greedy policy updates; high-dimensional robots; regularization term; robot arm; robust policy updates; stochastic optimal control; system dynamics approximations; trajectory optimization procedure; uninformative initial solutions; Approximation algorithms; Approximation methods; Gravity; Planning; Robot kinematics; Trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Humanoid Robots (Humanoids), 2014 14th IEEE-RAS International Conference on
Conference_Location :
Madrid
Type :
conf
DOI :
10.1109/HUMANOIDS.2014.7041389
Filename :
7041389
Link To Document :
بازگشت