Distributing rewards by strategic knowledge based on Nash-Q learning

Author

Igoshi, Kazuo ; Miura, Takao ; Shioya, Isamu

Author_Institution

Dept.of Electr. & Electr. Eng., Hosei Univ., Koganei

fYear

2008

fDate

4-6 Aug. 2008

Firstpage

458

Lastpage

463

Abstract

In this investigation, we examine collaboration approach to reward distribution in repeated general-sum stochastic games by multiple game players in terms of position and rewards. There have been several investigation of reward distribution discussed so far, and reinforcement has been considered useful since no knowledge is needed in advanced and better decision can be extracted while learning. Among others, Q-learning has been paid much attention under single agent environment. However, under multi-agent environment, we donpsilat have sharp targets to this problem, what is the most optimal principle? In this work, we discuss how to distribute reward thoroughly by considering as general stochastic games based on theory of games. That is, we introduce Nash-Q approach which combines Nash equilibrium with Q-learning. We show the new approach provides us with new strategic solution. We discuss some experiments of rather complicated games (game of life) to see the usefulness of the approach.

Keywords

learning (artificial intelligence); stochastic games; Nash equilibrium; Nash-Q learning; repeated general-sum stochastic games; reward distribution; strategic knowledge; Collaboration; Game theory; Nash equilibrium; Stochastic processes;

fLanguage

English

Publisher

ieee

Conference_Titel

Applications of Digital Information and Web Technologies, 2008. ICADIWT 2008. First International Conference on the

Conference_Location

Ostrava

Print_ISBN

978-1-4244-2623-2

Electronic_ISBN

978-1-4244-2624-9

Type

conf

DOI

10.1109/ICADIWT.2008.4664393

Filename

4664393