DocumentCode :
1450868
Title :
Fusion of Multiple Behaviors Using Layered Reinforcement Learning
Author :
Hwang, Kao-Shing ; Chen, Yu-Jen ; Wu, Chun-Ju
Author_Institution :
Electr. Eng. Dept., Nat. Sun Yat-sen Univ., Kaohsiung, Taiwan
Volume :
42
Issue :
4
fYear :
2012
fDate :
7/1/2012 12:00:00 AM
Firstpage :
999
Lastpage :
1004
Abstract :
This study introduces a method to enable a robot to learn how to perform new tasks through human demonstration and independent practice. The proposed process consists of two interconnected phases; in the first phase, state-action data are obtained from human demonstrations, and an aggregated state space is learned in terms of a decision tree that groups similar states together through reinforcement learning. Without the postprocess of trimming, in tree induction, the tree encodes a control policy that can be used to control the robot by means of repeatedly improving itself. Once a variety of behaviors is learned, more elaborate behaviors can be generated by selectively organizing several behaviors using another Q-learning algorithm. The composed outputs of the organized basic behaviors on the motor level are weighted using the policy learned through Q-learning. This approach uses three diverse Q-learning algorithms to learn complex behaviors. The experimental results show that the learned complicated behaviors, organized according to individual basic behaviors by the three Q-learning algorithms on different levels, can function more adaptively in a dynamic environment.
Keywords :
decision trees; intelligent robots; learning by example; state-space methods; Q-learning algorithm; aggregated state space learning; complex behavior learning; control policy; decision tree; human demonstrations; layered reinforcement learning; learning robot; motor level; multiple behavior fusion; similar state grouping; state-action data; tree induction; Aerospace electronics; Bismuth; Decision trees; Humans; Robot kinematics; Robot sensing systems; Behavior-based control; intelligent robots; reinforcement learning;
fLanguage :
English
Journal_Title :
Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4427
Type :
jour
DOI :
10.1109/TSMCA.2012.2183349
Filename :
6153389
Link To Document :
بازگشت