DocumentCode
1604623
Title
Phased Learning with Hierarchical Reinforcement Learning in Nonholonomic Motion Control
Author
Goto, Takaknuni ; Homma, Noriyasu ; Yoshizawa, Makoto ; Abe, Kenichi
Author_Institution
Dept. of Electr. & Commun. Eng., Tohoku Univ.
fYear
2006
Firstpage
4557
Lastpage
4562
Abstract
In this paper, a hierarchical reinforcement learning algorithm for controlling nonholonomic systems is proposed. When applying reinforcement learning to the nonholonomic systems, acquiring adequate policies is difficult because of an increase of learning steps and a convergence of local optimal policies. The proposed algorithm is inspired by such human learning behavior. Human can learn to control such systems sufficiently even if they initially have little knowledge about the system´s dynamics and the way to control. This human capability is suggested to be caused by their exploration strategies for acquiring the adequate policies. The key element of the proposed algorithm is a shaping function defined on a novel position-direction space. The shaping function is autonomously constructed once the goal is reached and constrains the exploration area to optimize the policy. The efficiency of the proposed shaping function was demonstrated by using a nonholonomic control problem of positioning the 2-link planer underactuated manipulator
Keywords
learning (artificial intelligence); manipulators; motion control; 2-link planer underactuated manipulator; human learning behavior; nonholonomic motion control; phased learning; reinforcement learning algorithm; shaping function; Communication system control; Computer science; Control systems; Convergence; Delay; Educational institutions; Humans; Learning; Medical control systems; Motion control; human learning behavior; nonholonomic systems; reinforcement learning; shaping function;
fLanguage
English
Publisher
ieee
Conference_Titel
SICE-ICASE, 2006. International Joint Conference
Conference_Location
Busan
Print_ISBN
89-950038-4-7
Electronic_ISBN
89-950038-5-5
Type
conf
DOI
10.1109/SICE.2006.315088
Filename
4108481
Link To Document