Extension of Improved Penalty Avoiding Rational Policy Making algorithm to tile coding environment for keepaway tasks

Author

WATanabe, Takuji ; Miyazaki, Koji ; Kobayashi, Hiroaki

Author_Institution

Dept. of Sci. & Technol., Meiji Univ., Kawasaki

fYear

2008

fDate

20-22 Aug. 2008

Firstpage

2039

Lastpage

2044

Abstract

We focus on potential capability of a profit sharing method (PS) in non-Markov multi-agent environments. It is shown that PS has some rationality in non-Markov environments and is also effective in multi-agent environments. However, conventional PS uses only a reward to learn suitable rules. On the other hand. ldquopenalty avoiding rational policy making algorithm (PARP)rdquo is based on PS and uses not only a reward but also penalties. PARP is improved to save memories and to cope with uncertainties, which is known as ldquoimproved penalty avoiding rational policy making algorithm (improved PARP).rdquo There is another critical problem we must cope with when we apply PS based methods to real environments; we need a huge amount of state information and most of states take continuous values. One solution for this problem is to approximate the states with a function approximation method, e.g. tile coding. In this paper, first, we extend improved penalty avoiding rational policy making algorithm to tile coding environments. Then, we compare the extended method with conventional methods to show the effectiveness through an application to a keepaway task in a soccer game.

Keywords

control engineering computing; function approximation; incentive schemes; learning (artificial intelligence); mobile robots; multi-agent systems; multi-robot systems; Robocup; function approximation; improved penalty avoiding rational policy making algorithm; keepaway task; nonMarkov multiagent environment; profit sharing; reinforcement learning; robot soccer; soccer game; tile coding environment; Tiles; Improved PARP; PARP; keepaway; multiagent systems; profit Sharing; rainfocement learning; robot soccer;

fLanguage

English

Publisher

ieee

Conference_Titel

SICE Annual Conference, 2008

Conference_Location

Tokyo

Print_ISBN

978-4-907764-30-2

Electronic_ISBN

978-4-907764-29-6

Type

conf

DOI

10.1109/SICE.2008.4654997

Filename

4654997