DocumentCode
1575934
Title
Reinforcement learning of impedance control in stochastic force fields
Author
Stulp, Freek ; Buchli, Jonas ; Ellmer, Alice ; Mistry, Michael ; Theodorou, Evangelos ; Schaal, Stefan
Author_Institution
Comput. Learning & Motor Control Lab., Univ. of Southern California, Los Angeles, CA, USA
Volume
2
fYear
2011
Firstpage
1
Lastpage
6
Abstract
Variable impedance control is essential for ensuring robust and safe physical interaction with the environment. As demonstrated in numerous force field experiments, humans combine two strategies to adapt their impedance to external perturbations: 1) if perturbations are unpredictable, subjects increase their impedance through co-contraction; 2) if perturbations are predictable, subjects learn a feed-forward command to counter the known perturbation. In this paper, we apply the force field paradigm to a simulated 7-DOF robot, by exerting stochastic forces on the robot´s end-effector. The robot `subject´ uses our model-free reinforcement learning algorithm PI2 to simultaneously learn the end-effector trajectories and variable impedance schedules. We demonstrate how the robot learns the same two-fold strategy to perturbation rejection as humans do, resulting in qualitatively similar behavior. Our results provide a biologically plausible approach to learning appropriate impedances purely from experience, without requiring a model of either body or environment dynamics.
Keywords
end effectors; feedforward; learning systems; stochastic systems; biologically plausible approach; end-effector trajectories; external perturbations; feedforward command; impedance perturbations; model-free reinforcement learning algorithm; physical interaction; robot end-effector; simulated 7-DOF robot; stochastic force fields; two-fold strategy; variable impedance control; Computational modeling; Noise measurement;
fLanguage
English
Publisher
ieee
Conference_Titel
Development and Learning (ICDL), 2011 IEEE International Conference on
Conference_Location
Frankfurt am Main
ISSN
2161-9476
Print_ISBN
978-1-61284-989-8
Type
conf
DOI
10.1109/DEVLRN.2011.6037312
Filename
6037312
Link To Document