DocumentCode
2341799
Title
Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression
Author
Morimoto, Jun ; Atkeson, Christopher G. ; Endo, Gen ; Cheng, Gordon
Author_Institution
Japan Sci. & Technol. Agency, Tokyo
fYear
2007
fDate
Oct. 29 2007-Nov. 2 2007
Firstpage
4234
Lastpage
4240
Abstract
We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although RL is a useful non-linear optimizer, it is usually difficult to apply RL to real robotic systems - due to the large number of iterations required to acquire suitable policies. In this study, we first approximated the dynamics by using data from a real robot, and then applied the estimated dynamics in RL in order to improve stepping and walking policies. Gaussian processes were used to approximate the dynamics. By using Gaussian processes, we could estimate a probability distribution of a target function with a given covariance function. Thus, RL can take the uncertainty of the approximated dynamics into account throughout the learning process. We show that we can improve stepping and walking policies by using a RL method with the approximated models both in simulated and real environments. Experimental validation on a real humanoid robot of the proposed
Keywords
Gaussian processes; covariance analysis; humanoid robots; learning (artificial intelligence); legged locomotion; regression analysis; statistical distributions; Gaussian process; biped stepping; covariance function; humanoid locomotive; humanoid robot; learnt approximated dynamics; probability distribution; regression; reinforcement learning; walking dynamics; Aerodynamics; Gaussian processes; Humanoid robots; Intelligent robots; Laboratories; Learning; Legged locomotion; Probability distribution; USA Councils; Uncertainty;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Robots and Systems, 2007. IROS 2007. IEEE/RSJ International Conference on
Conference_Location
San Diego, CA
Print_ISBN
978-1-4244-0912-9
Electronic_ISBN
978-1-4244-0912-9
Type
conf
DOI
10.1109/IROS.2007.4399485
Filename
4399485
Link To Document