DocumentCode :
2597548
Title :
On-line learning of a feedback controller for quasi-passive-dynamic walking by a stochastic policy gradient method
Author :
Hitomi, Kentarou ; Shibata, Tomohiro ; Nakamura, Yutaka ; Ishii, Shin
Author_Institution :
Nara Inst. of Sci. & Technol., Japan
fYear :
2005
fDate :
2-6 Aug. 2005
Firstpage :
3803
Lastpage :
3808
Abstract :
A class of biped locomotion called passive dynamic walking (PDW) has been recognized to be efficient in energy consumption and a key to understand human walking. Although PDW is sensitive to the initial condition and disturbances, some studies of quasi-PDW, which introduces supplementary actuators, are reported to overcome the sensitivity. In this article, for realization of the quasi-PDW, an on-line learning scheme of a feedback controller based on a policy gradient reinforcement learning method is proposed. Computer simulations show that the parameter in a quasi-PDW controller is automatically tuned by our method utilizing the passivity of the robot dynamics. The obtained controller is robust against variations in the slope gradient to some extent.
Keywords :
adaptive control; control engineering computing; feedback; gradient methods; learning (artificial intelligence); legged locomotion; motion control; robot dynamics; stochastic processes; adaptive control; biped locomotion; feedback controller; online learning; policy gradient reinforcement learning; quasipassive-dynamic walking; robot dynamics; stochastic policy gradient method; Actuators; Adaptive control; Automatic control; Computer simulation; Energy consumption; Gradient methods; Humans; Learning; Legged locomotion; Stochastic processes; 2D biped; adaptive control; passive dynamic walk; reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems, 2005. (IROS 2005). 2005 IEEE/RSJ International Conference on
Print_ISBN :
0-7803-8912-3
Type :
conf
DOI :
10.1109/IROS.2005.1545258
Filename :
1545258
Link To Document :
بازگشت