DocumentCode :
3622758
Title :
Reinforcement Learning Control for Biped Robot Walking on Uneven Surfaces
Author :
Shouyi Wang;J. Braaksma;R. Babuska;D. Hobbelen
Author_Institution :
Delft Center for Systems and Control, Delft University of Technology, Mekelweg 2, 2628 CD Delft, the Netherlands
fYear :
2006
fDate :
6/28/1905 12:00:00 AM
Firstpage :
4173
Lastpage :
4178
Abstract :
Biped robots based on the concept of (passive) dynamic walking are far simpler than the traditional fullyI controlled walking robots, while achieving a more natural gait and consuming less energy. However, lightly actuated dynamic walking robots, which rely on the natural limit cycle of their mechanical structure, are very sensitive to ground disturbances. Already a very small step down can cause the robot to lose stability. In this paper, we investigate the use of reinforcement learning to make a dynamic walking robot more robust against ground disturbances. The learning controller is applied to a simulated two-link biped which is an abstraction of a mechanical prototype developed at the Delft Biorobotics Laboratory. The learning controller has been designed such that it can be applied as a straightforward extension of the proportionalI-derivative (PD) controller currently used to drive the robot´s pneumatic actuators. The learning controller is therefore suitable for the future implementation in the robot hardware. Simulation results demonstrate that the biped quickly learns to overcome step-down disturbances on the floor up to 10% of the leg length, without compromising the natural walking style provided by the PD controller, which was optimized for walking on an even surface.
Keywords :
"Legged locomotion","Learning","Robot control","Robot sensing systems","Proportional control","PD control","Limit-cycles","Stability","Robustness","Virtual prototyping"
Publisher :
ieee
Conference_Titel :
Neural Networks, 2006. IJCNN ´06. International Joint Conference on
ISSN :
2161-4393
Print_ISBN :
0-7803-9490-9
Electronic_ISBN :
2161-4407
Type :
conf
DOI :
10.1109/IJCNN.2006.246966
Filename :
1716675
Link To Document :
بازگشت