• DocumentCode
    2616647
  • Title

    Phase-dependent trajectory optimization for CPG-based biped walking using path integral reinforcement learning

  • Author

    Sugimoto, Norikazu ; Morimoto, J. Un

  • Author_Institution
    Nat. Inst. of Inf. & Commun. Technol], Adv. ICT Res. Inst., Kyoto, Japan
  • fYear
    2011
  • fDate
    26-28 Oct. 2011
  • Firstpage
    255
  • Lastpage
    260
  • Abstract
    In this study, we introduce a phase-dependent trajectory optimization method for Central Pattern Generator (CPG)-based biped walking controllers. By exploiting the synchronization property of the CPG controller, many legged locomotion studies have shown that the CPG-based walking controller is robust against external perturbations and works well in real environments. However, due to the nonlinear dynamic property of the coupled oscillator system composed of the CPG controller and the robot, analytically designing the biped trajectory to satisfy the requirements of a target walking pattern is rather difficult. Therefore, using a nonlinear optimization method is reasonable to improve the walking trajectory. To optimize the walking trajectory, a model-free optimal control method is preferable because precise modeling of the ground contact is difficult. On the other hand, model-free trajectory optimization methods have been considered as quite computationally demanding approach. However, because of recent advances in the nonlinear trajectory optimization method, using the model-free optimization method is now a realistic approach for biped trajectory optimization. We use a path integral reinforcement learning method to improve the biped walking trajectory for CPG-based walking controllers.
  • Keywords
    learning (artificial intelligence); legged locomotion; optimal control; optimisation; oscillators; trajectory control; CPG controller; CPG-based biped walking; central pattern generator; coupled oscillator system; legged locomotion; model-free optimal control method; nonlinear dynamic property; nonlinear optimization method; path integral reinforcement learning; phase-dependent trajectory optimization; Robots;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Humanoid Robots (Humanoids), 2011 11th IEEE-RAS International Conference on
  • Conference_Location
    Bled
  • ISSN
    2164-0572
  • Print_ISBN
    978-1-61284-866-2
  • Electronic_ISBN
    2164-0572
  • Type

    conf

  • DOI
    10.1109/Humanoids.2011.6100908
  • Filename
    6100908