• DocumentCode
    3256622
  • Title

    Distributing rewards by strategic knowledge based on Nash-Q learning

  • Author

    Igoshi, Kazuo ; Miura, Takao ; Shioya, Isamu

  • Author_Institution
    Dept.of Electr. & Electr. Eng., Hosei Univ., Koganei
  • fYear
    2008
  • fDate
    4-6 Aug. 2008
  • Firstpage
    458
  • Lastpage
    463
  • Abstract
    In this investigation, we examine collaboration approach to reward distribution in repeated general-sum stochastic games by multiple game players in terms of position and rewards. There have been several investigation of reward distribution discussed so far, and reinforcement has been considered useful since no knowledge is needed in advanced and better decision can be extracted while learning. Among others, Q-learning has been paid much attention under single agent environment. However, under multi-agent environment, we donpsilat have sharp targets to this problem, what is the most optimal principle? In this work, we discuss how to distribute reward thoroughly by considering as general stochastic games based on theory of games. That is, we introduce Nash-Q approach which combines Nash equilibrium with Q-learning. We show the new approach provides us with new strategic solution. We discuss some experiments of rather complicated games (game of life) to see the usefulness of the approach.
  • Keywords
    learning (artificial intelligence); stochastic games; Nash equilibrium; Nash-Q learning; repeated general-sum stochastic games; reward distribution; strategic knowledge; Collaboration; Game theory; Nash equilibrium; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Digital Information and Web Technologies, 2008. ICADIWT 2008. First International Conference on the
  • Conference_Location
    Ostrava
  • Print_ISBN
    978-1-4244-2623-2
  • Electronic_ISBN
    978-1-4244-2624-9
  • Type

    conf

  • DOI
    10.1109/ICADIWT.2008.4664393
  • Filename
    4664393