DocumentCode
1684624
Title
Understanding goals in learning by interaction
Author
Batalov, Denis V.
Author_Institution
Carleton Univ., Ottawa, Ont., Canada
Volume
2
fYear
2002
fDate
6/24/1905 12:00:00 AM
Firstpage
1510
Lastpage
1515
Abstract
In reinforcement learning (RL) the goal of an agent is to maximize the sum of rewards that it receives. Agent designers must translate the desired goal into a particular reinforcement function. This process of translation is inherently error-prone because it is performed manually by human experimenters guided only by their experience and certain heuristic rules. It is a well-known phenomenon in RL when an agent rinds a way of maximizing the return without actually reaching the intended goal - all because of an incorrectly specified reinforcement function. For example, if in a game of chess we reward taking of the opponent´s piece, the agent might find it more profitable to take as many as possible at the expense of loosing the game. In this paper we first examine the notion of a goal and then based on our understanding of goals propose a generalized way of imparting goal information to agents as an alternative to reinforcements. We show that using this approach we can significantly simplify goal specification by making it less prone to errors and in many cases reduce the memory requirements of learning algorithms. Preliminary experimental results with modified Q-learning algorithms are also reported
Keywords
feedback; heuristic programming; learning (artificial intelligence); heuristic rules; learning algorithms; learning by interaction; modified Q-learning algorithms; reinforcement learning; Algorithm design and analysis; Animals; Feedback loop; Humans; Learning automata; Machine learning; Machine learning algorithms; Psychology;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2002. IJCNN '02. Proceedings of the 2002 International Joint Conference on
Conference_Location
Honolulu, HI
ISSN
1098-7576
Print_ISBN
0-7803-7278-6
Type
conf
DOI
10.1109/IJCNN.2002.1007741
Filename
1007741
Link To Document