• DocumentCode
    1280550
  • Title

    Reinforcement Learning With Sequences of Motion Primitives for Robust Manipulation

  • Author

    Stulp, Freek ; Theodorou, Evangelos A. ; Schaal, Stefan

  • Author_Institution
    Comput. Learning & Motor Control Lab., Univ. of Southern California (USC), Los Angeles, CA, USA
  • Volume
    28
  • Issue
    6
  • fYear
    2012
  • Firstpage
    1360
  • Lastpage
    1370
  • Abstract
    Physical contact events often allow a natural decomposition of manipulation tasks into action phases and subgoals. Within the motion primitive paradigm, each action phase corresponds to a motion primitive, and the subgoals correspond to the goal parameters of these primitives. Current state-of-the-art reinforcement learning algorithms are able to efficiently and robustly optimize the parameters of motion primitives in very high-dimensional problems. These algorithms often consider only shape parameters, which determine the trajectory between the start- and end-point of the movement. In manipulation, however, it is also crucial to optimize the goal parameters, which represent the subgoals between the motion primitives. We therefore extend the policy improvement with path integrals (PI2) algorithm to simultaneously optimize shape and goal parameters. Applying simultaneous shape and goal learning to sequences of motion primitives leads to the novel algorithm PI2 Seq. We use our methods to address a fundamental challenge in manipulation: improving the robustness of everyday pick-and-place tasks.
  • Keywords
    control engineering computing; learning (artificial intelligence); manipulators; motion control; trajectory control; PI2 algorithm; manipulation tasks; motion primitive sequences; movement trajectory; path integrals; physical contact events; pick-and-place tasks; policy improvement; reinforcement learning; robust manipulation; Adaptive systems; Grasping; Learning; Learning systems; Manipulators; Learning and adaptive systems; manipulation planning; reinforcement learning;
  • fLanguage
    English
  • Journal_Title
    Robotics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1552-3098
  • Type

    jour

  • DOI
    10.1109/TRO.2012.2210294
  • Filename
    6295672