• DocumentCode
    3178864
  • Title

    Policy Gradient Methods for Robotics

  • Author

    Peters, Jan ; Schaal, Stefan

  • Author_Institution
    Univ. of Southern California, Los Angeles, CA
  • fYear
    2006
  • fDate
    Oct. 2006
  • Firstpage
    2219
  • Lastpage
    2225
  • Abstract
    The acquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-structured environments. However, to date only few existing reinforcement learning methods have been scaled into the domains of high-dimensional robots such as manipulator, legged or humanoid robots. Policy gradient methods remain one of the few exceptions and have found a variety of applications. Nevertheless, the application of such methods is not without peril if done in an uninformed manner. In this paper, we give an overview on learning with policy gradient methods for robotics with a strong focus on recent advances in the field. We outline previous applications to robotics and show how the most recently developed methods can significantly improve learning performance. Finally, we evaluate our most promising algorithm in the application of hitting a baseball with an anthropomorphic arm
  • Keywords
    gradient methods; humanoid robots; learning (artificial intelligence); legged locomotion; manipulators; anthropomorphic arm; humanoid robots; legged robots; manipulator; motor skills; policy gradient method; reinforcement learning; Anthropomorphism; Gradient methods; Humanoid robots; Intelligent robots; Learning; Optimal control; Orbital robotics; Probability distribution; Robot control; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems, 2006 IEEE/RSJ International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    1-4244-0259-X
  • Electronic_ISBN
    1-4244-0259-X
  • Type

    conf

  • DOI
    10.1109/IROS.2006.282564
  • Filename
    4058714