• DocumentCode
    3560958
  • Title

    Imitation and Reinforcement Learning

  • Author

    Kober, Jens ; Peters, Jan

  • Author_Institution
    Dept. of Empirical Inference & Machine Learning, Max Planck Inst. for Biol. Cybern., Tübingen, Germany
  • Volume
    17
  • Issue
    2
  • fYear
    2010
  • fDate
    6/1/2010 12:00:00 AM
  • Firstpage
    55
  • Lastpage
    62
  • Abstract
    In this article, we present both novel learning algorithms and experiments using the dynamical system MPs. As such, we describe this MP representation in a way that it is straightforward to reproduce. We review an appropriate imitation learning method, i.e., locally weighted regression, and show how this method can be used both for initializing RL tasks as well as for modifying the start-up phase in a rhythmic task. We also show our current best-suited RL algorithm for this framework, i.e., PoWER. We present two complex motor tasks, i.e., ball-in-a-cup and ball paddling, learned on a real, physical Barrett WAM, using the methods presented in this article. Of particular interest is the ball-paddling application, as it requires a combination of both rhythmic and discrete dynamical systems MPs during the start-up phase to achieve a particular task.
  • Keywords
    discrete systems; learning (artificial intelligence); regression analysis; robots; ball paddling; ball-in-a-cup; discrete dynamical system; imitation learning; industrial robots; motor primitive; reinforcement learning; weighted regression; whole arm manipulator; Anthropomorphism; Humans; Intelligent robots; Learning systems; Legged locomotion; Manufacturing; Planar motors; Robot programming; Service robots; Stability;
  • fLanguage
    English
  • Journal_Title
    Robotics Automation Magazine, IEEE
  • Publisher
    ieee
  • Conference_Location
    6/1/2010 12:00:00 AM
  • ISSN
    1070-9932
  • Type

    jour

  • DOI
    10.1109/MRA.2010.936952
  • Filename
    5480345