• DocumentCode
    2615101
  • Title

    Model-free apprenticeship learning for transfer of human impedance behaviour

  • Author

    Mori, Takeshi ; Howard, Matthew ; Vijayakumar, Sethu

  • Author_Institution
    Sch. of Inf., Univ. of Edinburgh, Edinburgh, UK
  • fYear
    2011
  • fDate
    26-28 Oct. 2011
  • Firstpage
    239
  • Lastpage
    246
  • Abstract
    We present a method for transferring behaviour from humans to robots via apprenticeship learning. While previous methods have relied on an accurate model of the demonstrator´s dynamics, in most practical settings such models fail to capture (i) complex, non-linear dynamics of the hu- man musculoskeletal system, and (ii) inconsistencies between modelling assumptions and the configuration and placement of measurement apparatus. To avoid such issues, we propose a model-free approach to apprenticeship learning, in which off- policy, model-free reinforcement learning techniques are used to extract a model of the objective function optimised in human behaviour. As a key ingredient, we derive a novel formulation of Least Squares Policy Iteration (LSPI) and Least Squares Temporal Difference learning (LSTD) to enable their application in this setting. The robustness of our approach is demonstrated in experiments where human hitting behaviour is transferred to a non-biomorphic robotic device.
  • Keywords
    human-robot interaction; intelligent robots; learning (artificial intelligence); least squares approximations; muscle; nonlinear control systems; robot dynamics; social sciences; demonstrator dynamics; human behaviour; human hitting behaviour; human impedance behaviour; human musculoskeletal system; least squares policy iteration; least squares temporal difference learning; model-free apprenticeship learning; model-free reinforcement learning technique; nonbiomorphic robotic device; nonlinear dynamics; objective function; Computational modeling; Data models; Humans; Muscles; Optimization; Robots; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Humanoid Robots (Humanoids), 2011 11th IEEE-RAS International Conference on
  • Conference_Location
    Bled
  • ISSN
    2164-0572
  • Print_ISBN
    978-1-61284-866-2
  • Electronic_ISBN
    2164-0572
  • Type

    conf

  • DOI
    10.1109/Humanoids.2011.6100830
  • Filename
    6100830