• DocumentCode
    2615346
  • Title

    Hierarchical reinforcement learning with movement primitives

  • Author

    Stulp, Freek ; Schaal, Stefan

  • Author_Institution
    Comput. Learning & Motor Control Lab., Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2011
  • fDate
    26-28 Oct. 2011
  • Firstpage
    231
  • Lastpage
    238
  • Abstract
    Temporal abstraction and task decomposition drastically reduce the search space for planning and control, and are fundamental to making complex tasks amenable to learning. In the context of reinforcement learning, temporal abstractions are studied within the paradigm of hierarchical reinforcement learning. We propose a hierarchical reinforcement learning approach by applying our algorithm PI2 to sequences of Dynamic Movement Primitives. For robots, this representation has some important advantages over discrete representations in terms of scalability and convergence speed. The parameters of the Dynamic Movement Primitives are learned simultaneously at different levels of temporal abstraction. The shape of a movement primitive is optimized w.r.t. the costs up to the next primitive in the sequence, and the subgoals between two movement primitives w.r.t. the costs up to the end of the entire movement primitive sequence. We implement our approach on an 11-DOF arm and hand, and evaluate it in a pick-and-place task in which the robot transports an object between different shelves in a cupboard.
  • Keywords
    learning (artificial intelligence); optimisation; robots; 11-DOF arm; PI2 algorithm; discrete representation; dynamic movement primitive sequence; hierarchical reinforcement learning; optimization; pick-and-place task; task decomposition; temporal abstraction; Aerospace electronics; Cost function; Learning; Noise; Robots; Shape; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Humanoid Robots (Humanoids), 2011 11th IEEE-RAS International Conference on
  • Conference_Location
    Bled
  • ISSN
    2164-0572
  • Print_ISBN
    978-1-61284-866-2
  • Electronic_ISBN
    2164-0572
  • Type

    conf

  • DOI
    10.1109/Humanoids.2011.6100841
  • Filename
    6100841