مرکز منطقه ای اطلاع رساني علوم و فناوري - Automatic reward shaping in Reinforcement Learning using graph analysis

DocumentCode :

2732176

Title :

Automatic reward shaping in Reinforcement Learning using graph analysis

Author :

Marashi, Maryam ; Khalilian, Alireza ; Shiri, Mohammad Ebrahim

Author_Institution :

Sch. of Math & Comput. Sci., Amirkabir Univ. of Technol., Tehran, Iran

fYear :

2012

fDate :

18-19 Oct. 2012

Firstpage :

111

Lastpage :

116

Abstract :

Reinforcement Learning is a popular context of machine learning that aims at improving the behavior of autonomous agents that learn from interactions with the environment. However, it is often costly, time consuming, and even dangerous. To deal with these problems, reward shaping has been used as a powerful method to accelerate the learning speed of the agent. The principle idea is to incorporate a numerical feedback, other than environment reward, for the learning agent. However, finding an efficient potential function to shape the reward is still an interesting area of research. In this paper, a new algorithm has been proposed that receives the environment graph, performs some new analysis, and provides the extracted information for the learning agent to accelerate the speed of learning. This information includes sub goals, bad states, and sub environments with different exploration, or reward, values. To evaluate this algorithm an experimental study has been conducted on two benchmark environments, Six Rooms and Maze. The obtained results demonstrate the effectiveness of the proposed algorithm.

Keywords :

graph theory; learning (artificial intelligence); multi-agent systems; agent learning speed; environment reward; graph analysis; machine learning; maze environment; numerical feedback; reinforcement learning; reward shaping; six rooms environment; Algorithm design and analysis; Convergence; Educational institutions; Heuristic algorithms; Learning; Machine learning; Markov processes; Artificial Feedback; Q-Learning; Reinforcement Learning; Reward Shaping;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer and Knowledge Engineering (ICCKE), 2012 2nd International eConference on

Conference_Location :

Mashhad

Print_ISBN :

978-1-4673-4475-3

Type :

conf

DOI :

10.1109/ICCKE.2012.6395362

Filename :

6395362

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2732176