مرکز منطقه ای اطلاع رساني علوم و فناوري - Evolutionary approach of reward function for reinforcement learning using genetic programming

DocumentCode :

2632348

Title :

Evolutionary approach of reward function for reinforcement learning using genetic programming

Author :

Sumino, Shota ; Mutoh, Atsuko ; Kato, Shohei

Author_Institution :

Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol., Nagoya, Japan

fYear :

2011

fDate :

6-9 Nov. 2011

Firstpage :

385

Lastpage :

390

Abstract :

In recent year, reinforcement learning, which acquires a behavior of robots has been drawing attention. A suitable behavior is autonomously acquired by using this system. Robots learn the suitable behavior by iterating action and receiving the evaluated value of that action. The evaluated value is calculated by reward function. In general reinforcement learning, we acquire a suitable behavior by setting the suitable reward function for each problem. However in previous research of reinforcement learning, most reward functions are constructed based on human´s heuristics. To construct reward functions, trial-and-error is needed, and it imposes an enormous drain on humans. Therefore we propose an approach, which automatically generate reward functions, using Genetic Programming. In this approach, we create a method evaluating reward functions. Reward functions are generated by Genetic Programming, and are evaluated by evaluating method. A suitable reward function is generated by evolution of these reward functions. In this paper, we conducted an experiment to confirm the effectiveness of proposed method. In the experiment, we generate a suitable reward function of a problem, which a route searching problem in a tile-world. Through the experiment, we confirm that the proposed approach can generate a suitable reward function, and the generated reward function can acquire a more suitable behavior in comparison with a reward function by constructed based on human´s heuristics.

Keywords :

genetic algorithms; learning systems; mobile robots; search problems; evolutionary approach; general reinforcement learning; genetic programming; human heuristics; reward function; robot behavior; route searching problem; trial-and-error; Genetics; Robots;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Micro-NanoMechatronics and Human Science (MHS), 2011 International Symposium on

Conference_Location :

Nagoya

ISSN :

Pending

Print_ISBN :

978-1-4577-1360-6

Type :

conf

DOI :

10.1109/MHS.2011.6102214

Filename :

6102214

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2632348