• DocumentCode
    172868
  • Title

    Reinforcement learning approach to locomotion adaptation in sloped environments

  • Author

    Andre, Joao ; Costa, Luis ; Santos, Cristina

  • Author_Institution
    Dept. of Ind. Electron., Univ. of Minho, Braga, Portugal
  • fYear
    2014
  • fDate
    14-15 May 2014
  • Firstpage
    164
  • Lastpage
    169
  • Abstract
    In this work, Path Integral Policy Improvement with Covariance Matrix Adaptation (PI2-CMA) is implemented and used to address biped locomotion optimization. Central Pattern Generators (CPGs) and Dynamic Movement Primitives (DMPs) are combined to easily produce complex joint trajectories for simulated DARwIn-OP humanoid robot. PI2-CMA seeks optimal DMPs´ weights that maximize frontal velocity when facing different challenges. The simulation environments demand adaptation from the controller in order to successfully walk in different slopes. Elitism was introduced in PI2-CMA in order to improve the convergence property of the algorithm. Results show that these approaches enabled easy adaptation of DARwIn-OP to new situations. The results are very promising and demonstrate the flexibility at generating new trajectories for locomotion.
  • Keywords
    covariance matrices; humanoid robots; learning (artificial intelligence); legged locomotion; motion control; trajectory control; CPG; DARwIn-OP humanoid robot; DMP; PI2-CMA; biped locomotion optimization; central pattern generators; dynamic movement primitives; locomotion adaptation; path integral policy improvement with covariance matrix adaptation; reinforcement learning approach; sloped environment; trajectory generation; Convergence; Generators; Hip; Joints; Legged locomotion; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Autonomous Robot Systems and Competitions (ICARSC), 2014 IEEE International Conference on
  • Conference_Location
    Espinho
  • Type

    conf

  • DOI
    10.1109/ICARSC.2014.6849780
  • Filename
    6849780