• DocumentCode
    3157970
  • Title

    Extension of Improved Penalty Avoiding Rational Policy Making algorithm to tile coding environment for keepaway tasks

  • Author

    WATanabe, Takuji ; Miyazaki, Koji ; Kobayashi, Hiroaki

  • Author_Institution
    Dept. of Sci. & Technol., Meiji Univ., Kawasaki
  • fYear
    2008
  • fDate
    20-22 Aug. 2008
  • Firstpage
    2039
  • Lastpage
    2044
  • Abstract
    We focus on potential capability of a profit sharing method (PS) in non-Markov multi-agent environments. It is shown that PS has some rationality in non-Markov environments and is also effective in multi-agent environments. However, conventional PS uses only a reward to learn suitable rules. On the other hand. ldquopenalty avoiding rational policy making algorithm (PARP)rdquo is based on PS and uses not only a reward but also penalties. PARP is improved to save memories and to cope with uncertainties, which is known as ldquoimproved penalty avoiding rational policy making algorithm (improved PARP).rdquo There is another critical problem we must cope with when we apply PS based methods to real environments; we need a huge amount of state information and most of states take continuous values. One solution for this problem is to approximate the states with a function approximation method, e.g. tile coding. In this paper, first, we extend improved penalty avoiding rational policy making algorithm to tile coding environments. Then, we compare the extended method with conventional methods to show the effectiveness through an application to a keepaway task in a soccer game.
  • Keywords
    control engineering computing; function approximation; incentive schemes; learning (artificial intelligence); mobile robots; multi-agent systems; multi-robot systems; Robocup; function approximation; improved penalty avoiding rational policy making algorithm; keepaway task; nonMarkov multiagent environment; profit sharing; reinforcement learning; robot soccer; soccer game; tile coding environment; Tiles; Improved PARP; PARP; keepaway; multiagent systems; profit Sharing; rainfocement learning; robot soccer;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    SICE Annual Conference, 2008
  • Conference_Location
    Tokyo
  • Print_ISBN
    978-4-907764-30-2
  • Electronic_ISBN
    978-4-907764-29-6
  • Type

    conf

  • DOI
    10.1109/SICE.2008.4654997
  • Filename
    4654997