DocumentCode :
2682093
Title :
Consideration on robotic giant-swing motion generated by reinforcement learning
Author :
Hara, M. ; Kawabe, N. ; Sakai, N. ; Huang, J. ; Bleuler, Hannes ; Yabuta, T.
Author_Institution :
Ecole Polytech. Fed. de Lausanne, Lausanne, Switzerland
fYear :
2009
fDate :
10-15 Oct. 2009
Firstpage :
4206
Lastpage :
4211
Abstract :
This study attempts to make a compact humanoid robot acquire a giant-swing motion without any robotic models by using reinforcement learning; only the interaction with environment is available. Generally, it is widely said that this type of learning method is not appropriated to obtain dynamic motions because Markov property is not necessarily guaranteed during the dynamic task. However, in this study, we try to avoid this problem by embedding the dynamic information in the robotic state space; the applicability of the proposed method is considered using both the real robot and dynamic simulator. This paper, in particular, discusses how the robot with 5-DOF, in which the Q-Learning algorithm is implemented, acquires a giant-swing motion. Further, we describe the reward effects on the Q-Learning. Finally, this paper demonstrates that the application of the Q-Learning enable the robot to perform a very attractive giant-swing motion.
Keywords :
algorithm theory; learning (artificial intelligence); motion control; real-time systems; robots; 5DOF robot; Markov property; Q learning algorithm; compact humanoid robot; dynamic simulator; embedding dynamic iniformation; giant swing motion; obtain dynamic motions; reinforcement learning; robotic giant swing motion; robotic state space; Human robot interaction; Humanoid robots; Intelligent robots; Learning systems; Legged locomotion; Mobile robots; Orbital robotics; Robot motion; Supervised learning; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems, 2009. IROS 2009. IEEE/RSJ International Conference on
Conference_Location :
St. Louis, MO
Print_ISBN :
978-1-4244-3803-7
Electronic_ISBN :
978-1-4244-3804-4
Type :
conf
DOI :
10.1109/IROS.2009.5354270
Filename :
5354270
Link To Document :
بازگشت