• DocumentCode
    2383301
  • Title

    A comparison of two algorithms for robot learning from demonstration

  • Author

    Suay, Halit Bener ; Chernova, Sonia

  • Author_Institution
    Robot. Eng. Program, Worcester Polytech. Inst., Worcester, MA, USA
  • fYear
    2011
  • fDate
    9-12 Oct. 2011
  • Firstpage
    2495
  • Lastpage
    2500
  • Abstract
    Robot learning from demonstration focuses on algorithms that enable a robot to learn a policy from demonstrations performed by a teacher, typically a human expert. This paper presents an experimental evaluation of two learning from demonstration algorithms, Interactive Reinforcement Learning and Behavior Networks. We evaluate the performance of these algorithms using a humanoid robot and discuss the relative advantages and drawbacks of these methods with respect to learning time, number of demonstrations, ease of implementation and other metrics. Our results show that Behavior Networks rely on a greater degree of domain knowledge and programmer expertise, requiring very precise definitions for behavior pre- and post-conditions. By contrast Interactive RL requires a relatively simple implementation based only on the robot´s sensor data and actions. However, Behavior Networks leverage the pre-coded knowledge to effectively reduce learning time and the required number of human interactions to learn the task.
  • Keywords
    human-robot interaction; humanoid robots; learning by example; behavior networks; human interactions; humanoid robot; interactive reinforcement learning; robot learning from demonstration; Actuators; Humans; Knowledge engineering; Learning; Robot sensing systems; Strontium; Learning and Adaptive Systems; Personal Robots;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics (SMC), 2011 IEEE International Conference on
  • Conference_Location
    Anchorage, AK
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4577-0652-3
  • Type

    conf

  • DOI
    10.1109/ICSMC.2011.6084052
  • Filename
    6084052