Reinforcement learning approach to locomotion adaptation in sloped environments

Author

Andre, Joao ; Costa, Luis ; Santos, Cristina

Author_Institution

Dept. of Ind. Electron., Univ. of Minho, Braga, Portugal

fYear

2014

fDate

14-15 May 2014

Firstpage

164

Lastpage

169

Abstract

In this work, Path Integral Policy Improvement with Covariance Matrix Adaptation (PI²-CMA) is implemented and used to address biped locomotion optimization. Central Pattern Generators (CPGs) and Dynamic Movement Primitives (DMPs) are combined to easily produce complex joint trajectories for simulated DARwIn-OP humanoid robot. PI²-CMA seeks optimal DMPs´ weights that maximize frontal velocity when facing different challenges. The simulation environments demand adaptation from the controller in order to successfully walk in different slopes. Elitism was introduced in PI²-CMA in order to improve the convergence property of the algorithm. Results show that these approaches enabled easy adaptation of DARwIn-OP to new situations. The results are very promising and demonstrate the flexibility at generating new trajectories for locomotion.

Keywords

covariance matrices; humanoid robots; learning (artificial intelligence); legged locomotion; motion control; trajectory control; CPG; DARwIn-OP humanoid robot; DMP; PI²-CMA; biped locomotion optimization; central pattern generators; dynamic movement primitives; locomotion adaptation; path integral policy improvement with covariance matrix adaptation; reinforcement learning approach; sloped environment; trajectory generation; Convergence; Generators; Hip; Joints; Legged locomotion; Trajectory;

fLanguage

English

Publisher

ieee

Conference_Titel

Autonomous Robot Systems and Competitions (ICARSC), 2014 IEEE International Conference on

Conference_Location

Espinho

Type

conf

DOI

10.1109/ICARSC.2014.6849780

Filename

6849780