DocumentCode :
2632348
Title :
Evolutionary approach of reward function for reinforcement learning using genetic programming
Author :
Sumino, Shota ; Mutoh, Atsuko ; Kato, Shohei
Author_Institution :
Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol., Nagoya, Japan
fYear :
2011
fDate :
6-9 Nov. 2011
Firstpage :
385
Lastpage :
390
Abstract :
In recent year, reinforcement learning, which acquires a behavior of robots has been drawing attention. A suitable behavior is autonomously acquired by using this system. Robots learn the suitable behavior by iterating action and receiving the evaluated value of that action. The evaluated value is calculated by reward function. In general reinforcement learning, we acquire a suitable behavior by setting the suitable reward function for each problem. However in previous research of reinforcement learning, most reward functions are constructed based on human´s heuristics. To construct reward functions, trial-and-error is needed, and it imposes an enormous drain on humans. Therefore we propose an approach, which automatically generate reward functions, using Genetic Programming. In this approach, we create a method evaluating reward functions. Reward functions are generated by Genetic Programming, and are evaluated by evaluating method. A suitable reward function is generated by evolution of these reward functions. In this paper, we conducted an experiment to confirm the effectiveness of proposed method. In the experiment, we generate a suitable reward function of a problem, which a route searching problem in a tile-world. Through the experiment, we confirm that the proposed approach can generate a suitable reward function, and the generated reward function can acquire a more suitable behavior in comparison with a reward function by constructed based on human´s heuristics.
Keywords :
genetic algorithms; learning systems; mobile robots; search problems; evolutionary approach; general reinforcement learning; genetic programming; human heuristics; reward function; robot behavior; route searching problem; trial-and-error; Genetics; Robots;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Micro-NanoMechatronics and Human Science (MHS), 2011 International Symposium on
Conference_Location :
Nagoya
ISSN :
Pending
Print_ISBN :
978-1-4577-1360-6
Type :
conf
DOI :
10.1109/MHS.2011.6102214
Filename :
6102214
Link To Document :
بازگشت