DocumentCode :
2765727
Title :
Reinforcement Learning for Parameterized Motor Primitives
Author :
Peters, Jan ; Schaal, Stefan
Author_Institution :
Department of Computer Science, University of Southern California, Los Angeles, CA 90089, USA. phone: 213-740-6717; fax: 213-740-1510; email: jrpeters@usc.edu
fYear :
2006
fDate :
16-21 July 2006
Firstpage :
73
Lastpage :
80
Abstract :
One of the major challenges in both action generation for robotics and in the understanding of human motor control is to learn the "building blocks of movement generation", called motor primitives. Motor primitives, as used in this paper, are parameterized control policies such as splines or nonlinear differential equations with desired attractor properties. While a lot of progress has been made in teaching parameterized motor primitives using supervised or imitation learning, the selfimprovement by interaction of the system with the environment remains a challenging problem. In this paper, we evaluate different reinforcement learning approaches for improving the performance of parameterized motor primitives. For pursuing this goal, we highlight the difficulties with current reinforcement learning methods, and outline both established and novel algorithms for the gradient-based improvement of parameterized policies. We compare these algorithms in the context of motor primitive learning, and show that our most modern algorithm, the Episodic Natural Actor-Critic outperforms previous algorithms by at least an order of magnitude. We demonstrate the efficiency of this reinforcement learning method in the application of learning to hit a baseball with an anthropomorphic robot arm.
Keywords :
Anthropomorphism; Differential equations; Education; Humanoid robots; Humans; Learning; Mechanical splines; Motor drives; Robot kinematics; Trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2006. IJCNN '06. International Joint Conference on
Print_ISBN :
0-7803-9490-9
Type :
conf
DOI :
10.1109/IJCNN.2006.246662
Filename :
1716073
Link To Document :
بازگشت