DocumentCode
3256622
Title
Distributing rewards by strategic knowledge based on Nash-Q learning
Author
Igoshi, Kazuo ; Miura, Takao ; Shioya, Isamu
Author_Institution
Dept.of Electr. & Electr. Eng., Hosei Univ., Koganei
fYear
2008
fDate
4-6 Aug. 2008
Firstpage
458
Lastpage
463
Abstract
In this investigation, we examine collaboration approach to reward distribution in repeated general-sum stochastic games by multiple game players in terms of position and rewards. There have been several investigation of reward distribution discussed so far, and reinforcement has been considered useful since no knowledge is needed in advanced and better decision can be extracted while learning. Among others, Q-learning has been paid much attention under single agent environment. However, under multi-agent environment, we donpsilat have sharp targets to this problem, what is the most optimal principle? In this work, we discuss how to distribute reward thoroughly by considering as general stochastic games based on theory of games. That is, we introduce Nash-Q approach which combines Nash equilibrium with Q-learning. We show the new approach provides us with new strategic solution. We discuss some experiments of rather complicated games (game of life) to see the usefulness of the approach.
Keywords
learning (artificial intelligence); stochastic games; Nash equilibrium; Nash-Q learning; repeated general-sum stochastic games; reward distribution; strategic knowledge; Collaboration; Game theory; Nash equilibrium; Stochastic processes;
fLanguage
English
Publisher
ieee
Conference_Titel
Applications of Digital Information and Web Technologies, 2008. ICADIWT 2008. First International Conference on the
Conference_Location
Ostrava
Print_ISBN
978-1-4244-2623-2
Electronic_ISBN
978-1-4244-2624-9
Type
conf
DOI
10.1109/ICADIWT.2008.4664393
Filename
4664393
Link To Document