DocumentCode :
2045827
Title :
A study on use of prior information for acceleration of reinforcement learning
Author :
Terashima, Kento ; Murata, Junichi
Author_Institution :
Dept. of Electr. & Electron. Eng., Kyushu Univ., Fukuoka, Japan
fYear :
2011
fDate :
13-18 Sept. 2011
Firstpage :
537
Lastpage :
543
Abstract :
Reinforcement learning is a method with which an agent learns appropriate response for solving problems by trial-and-error. The advantage is that reinforcement learning can be applied to unknown or uncertain problems. But instead, there is a drawback that this method needs a long time to solve the problem because of trial-and-error. If there is prior information about the environment, some of trial-and-error can be spared and the learning can take a shorter time. The prior information provided by a human designer can be wrong because of uncertainties in the problems. If the wrong prior information is used, there can be bad effects such as failure to get the optimal policy and slowing down of reinforcement learning. We propose to control use of the prior information to suppress the bad effects. The agent forgets the prior information gradually by multiplying a forgetting factor while it learns the better policy. We apply the proposed method to a couple of testbed environments and a number of types of prior information. The method shows the good results in terms of both the learning speed and the quality of obtained policies.
Keywords :
learning (artificial intelligence); multi-agent systems; problem solving; uncertainty handling; agent learning; optimal policy; prior information; problem solving; reinforcement learning; trial-and-error; uncertainties; Acceleration; Educational institutions; Focusing; Humans; Learning; Learning systems; Trajectory; exploring visit; forgetting factor; option; prior information; reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
SICE Annual Conference (SICE), 2011 Proceedings of
Conference_Location :
Tokyo
ISSN :
pending
Print_ISBN :
978-1-4577-0714-8
Type :
conf
Filename :
6060724
Link To Document :
بازگشت