DocumentCode
3563672
Title
Tractable infinite order Markov analysis for iterated games with learners
Author
Hidaka, Shohei ; Torii, Takuma ; Masumi, Akira
Author_Institution
Sch. of Knowledge Sci., Japan Adv. Inst. of Sci. & Technol., Nomi, Japan
fYear
2014
Firstpage
286
Lastpage
291
Abstract
The theory of games involving players who adaptively learn from their past experiences is not yet well understood. We analyze games in which players make on each turn a probabilistic choice of actions determined by a kth-order Markov process which signifies how they learn from their past k actions for a fixed number k. As the number of states in such Markov processes grows exponentially with k, the analysis of games involving learners with long memories has been viewed as computationally intractable. This study develops a technique which enables feasible analysis of these long-memory Markov process. We further show that, for two players involved in an iterated prisoners´ dilemma, the probability of mutual defection increases with the size of their memories. This result is consistent with the classical prisoners´ dilemma with two rational players.
Keywords
Markov processes; game theory; learning (artificial intelligence); iterated games; kth-order Markov process; long-memory Markov process; probabilistic choice; tractable infinite order Markov analysis; Games; Learning (artificial intelligence); Markov processes; Matrix decomposition; Nash equilibrium; Probabilistic logic; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Soft Computing and Intelligent Systems (SCIS), 2014 Joint 7th International Conference on and Advanced Intelligent Systems (ISIS), 15th International Symposium on
Type
conf
DOI
10.1109/SCIS-ISIS.2014.7044671
Filename
7044671
Link To Document