• DocumentCode
    117531
  • Title

    Efficient reuse of previous experiences in humanoid motor learning

  • Author

    Sugimoto, Norikazu ; Tangkaratt, Voot ; Wensveen, Thijs ; Tingting Zhao ; Sugiyama, Masashi ; Morimoto, Jun

  • Author_Institution
    Dept. of Center for Inf. & Neural Networks, Nat. Inst. of Inf. & Commun. Technol., Kobe, Japan
  • fYear
    2014
  • fDate
    18-20 Nov. 2014
  • Firstpage
    554
  • Lastpage
    559
  • Abstract
    In this study, we show that the motor control performance of a humanoid robot can be improved efficiently using its previous experiences in a Reinforcement Learning (RL) framework. RL is becoming a common approach to acquire a nonlinear optimal policy through trial and error. However, applying RL to real robot control is very difficult since it usually requires many learning trials. Such trials cannot be executed in real environments due to the limited durability of the real system. Therefore, in this study, instead of executing many learning trials, we use a recently developed RL algorithm called importance-weighted Policy Gradients with Parameter based Exploration (PGPE), with which the robot can efficiently reuse the previously sampled data to improve its policy parameters. We apply importance-weighted PGPE to CB-i, our real humanoid robot, and show that it can learn both target-reaching movement and cart-pole swing-up movements in a real environment within 10 minutes without any prior knowledge of the task or any carefully designed initial trajectory.
  • Keywords
    gradient methods; humanoid robots; learning (artificial intelligence); motion control; nonlinear control systems; optimal control; cart-pole swing-up movement; humanoid motor learning; humanoid robot; importance-weighted policy gradient; motor control; nonlinear optimal policy; parameter based exploration; reinforcement learning; target-reaching movement; Humanoid robots; Joints; Robot control; Standards; Trajectory; Virtual environments;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Humanoid Robots (Humanoids), 2014 14th IEEE-RAS International Conference on
  • Conference_Location
    Madrid
  • Type

    conf

  • DOI
    10.1109/HUMANOIDS.2014.7041417
  • Filename
    7041417