• DocumentCode
    2765727
  • Title

    Reinforcement Learning for Parameterized Motor Primitives

  • Author

    Peters, Jan ; Schaal, Stefan

  • Author_Institution
    Department of Computer Science, University of Southern California, Los Angeles, CA 90089, USA. phone: 213-740-6717; fax: 213-740-1510; email: jrpeters@usc.edu
  • fYear
    2006
  • fDate
    16-21 July 2006
  • Firstpage
    73
  • Lastpage
    80
  • Abstract
    One of the major challenges in both action generation for robotics and in the understanding of human motor control is to learn the "building blocks of movement generation", called motor primitives. Motor primitives, as used in this paper, are parameterized control policies such as splines or nonlinear differential equations with desired attractor properties. While a lot of progress has been made in teaching parameterized motor primitives using supervised or imitation learning, the selfimprovement by interaction of the system with the environment remains a challenging problem. In this paper, we evaluate different reinforcement learning approaches for improving the performance of parameterized motor primitives. For pursuing this goal, we highlight the difficulties with current reinforcement learning methods, and outline both established and novel algorithms for the gradient-based improvement of parameterized policies. We compare these algorithms in the context of motor primitive learning, and show that our most modern algorithm, the Episodic Natural Actor-Critic outperforms previous algorithms by at least an order of magnitude. We demonstrate the efficiency of this reinforcement learning method in the application of learning to hit a baseball with an anthropomorphic robot arm.
  • Keywords
    Anthropomorphism; Differential equations; Education; Humanoid robots; Humans; Learning; Mechanical splines; Motor drives; Robot kinematics; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2006. IJCNN '06. International Joint Conference on
  • Print_ISBN
    0-7803-9490-9
  • Type

    conf

  • DOI
    10.1109/IJCNN.2006.246662
  • Filename
    1716073