DocumentCode
172868
Title
Reinforcement learning approach to locomotion adaptation in sloped environments
Author
Andre, Joao ; Costa, Luis ; Santos, Cristina
Author_Institution
Dept. of Ind. Electron., Univ. of Minho, Braga, Portugal
fYear
2014
fDate
14-15 May 2014
Firstpage
164
Lastpage
169
Abstract
In this work, Path Integral Policy Improvement with Covariance Matrix Adaptation (PI2-CMA) is implemented and used to address biped locomotion optimization. Central Pattern Generators (CPGs) and Dynamic Movement Primitives (DMPs) are combined to easily produce complex joint trajectories for simulated DARwIn-OP humanoid robot. PI2-CMA seeks optimal DMPs´ weights that maximize frontal velocity when facing different challenges. The simulation environments demand adaptation from the controller in order to successfully walk in different slopes. Elitism was introduced in PI2-CMA in order to improve the convergence property of the algorithm. Results show that these approaches enabled easy adaptation of DARwIn-OP to new situations. The results are very promising and demonstrate the flexibility at generating new trajectories for locomotion.
Keywords
covariance matrices; humanoid robots; learning (artificial intelligence); legged locomotion; motion control; trajectory control; CPG; DARwIn-OP humanoid robot; DMP; PI2-CMA; biped locomotion optimization; central pattern generators; dynamic movement primitives; locomotion adaptation; path integral policy improvement with covariance matrix adaptation; reinforcement learning approach; sloped environment; trajectory generation; Convergence; Generators; Hip; Joints; Legged locomotion; Trajectory;
fLanguage
English
Publisher
ieee
Conference_Titel
Autonomous Robot Systems and Competitions (ICARSC), 2014 IEEE International Conference on
Conference_Location
Espinho
Type
conf
DOI
10.1109/ICARSC.2014.6849780
Filename
6849780
Link To Document