مرکز منطقه ای اطلاع رساني علوم و فناوري - Phase-dependent trajectory optimization for CPG-based biped walking using path integral reinforcement learning

DocumentCode :

2616647

Title :

Phase-dependent trajectory optimization for CPG-based biped walking using path integral reinforcement learning

Author :

Sugimoto, Norikazu ; Morimoto, J. Un

Author_Institution :

Nat. Inst. of Inf. & Commun. Technol], Adv. ICT Res. Inst., Kyoto, Japan

fYear :

2011

fDate :

26-28 Oct. 2011

Firstpage :

255

Lastpage :

260

Abstract :

In this study, we introduce a phase-dependent trajectory optimization method for Central Pattern Generator (CPG)-based biped walking controllers. By exploiting the synchronization property of the CPG controller, many legged locomotion studies have shown that the CPG-based walking controller is robust against external perturbations and works well in real environments. However, due to the nonlinear dynamic property of the coupled oscillator system composed of the CPG controller and the robot, analytically designing the biped trajectory to satisfy the requirements of a target walking pattern is rather difficult. Therefore, using a nonlinear optimization method is reasonable to improve the walking trajectory. To optimize the walking trajectory, a model-free optimal control method is preferable because precise modeling of the ground contact is difficult. On the other hand, model-free trajectory optimization methods have been considered as quite computationally demanding approach. However, because of recent advances in the nonlinear trajectory optimization method, using the model-free optimization method is now a realistic approach for biped trajectory optimization. We use a path integral reinforcement learning method to improve the biped walking trajectory for CPG-based walking controllers.

Keywords :

learning (artificial intelligence); legged locomotion; optimal control; optimisation; oscillators; trajectory control; CPG controller; CPG-based biped walking; central pattern generator; coupled oscillator system; legged locomotion; model-free optimal control method; nonlinear dynamic property; nonlinear optimization method; path integral reinforcement learning; phase-dependent trajectory optimization; Robots;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Humanoid Robots (Humanoids), 2011 11th IEEE-RAS International Conference on

Conference_Location :

Bled

ISSN :

2164-0572

Print_ISBN :

978-1-61284-866-2

Electronic_ISBN :

2164-0572

Type :

conf

DOI :

10.1109/Humanoids.2011.6100908

Filename :

6100908

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2616647