DocumentCode
2826476
Title
How to explore your opponent´s strategy (almost) optimally
Author
Carmel, David ; Markovitch, Shad
Author_Institution
Dept. of Comput. Sci., Technion-Israel Inst. of Technol., Haifa, Israel
fYear
1998
fDate
3-7 Jul 1998
Firstpage
64
Lastpage
71
Abstract
Presents a lookahead-based exploration strategy for a model-based learning agent that enables exploration of the opponent´s behavior during interaction in a multi-agent system. Instead of holding one model, the model-based agent maintains a mixed opponent model, a distribution over a set of models that reflects its uncertainty about the opponent´s strategy. Every action is evaluated according to its long run contribution to the expected utility and to the knowledge regarding the opponent´s strategy. We present an efficient algorithm that returns an almost optimal exploration strategy against a given mixed model, and a learning method for acquiring a mixed model consistent with the opponent´s past behavior. We report experimental results in the Iterated Prisoner´s Dilemma game that demonstrate the superiority of the lookahead-based exploration strategy over other exploration methods
Keywords
game theory; learning automata; software agents; Iterated Prisoner´s Dilemma game; exploration strategy; lookahead; mixed opponent model; model-based learning agent; multi-agent system; Books; Computer science; Costs; Economic forecasting; History; Laboratories; Learning automata; Learning systems; Multiagent systems; Neural networks; Power generation economics; Predictive models; Uncertainty; Utility theory;
fLanguage
English
Publisher
ieee
Conference_Titel
Multi Agent Systems, 1998. Proceedings. International Conference on
Conference_Location
Paris
Print_ISBN
0-8186-8500-X
Type
conf
DOI
10.1109/ICMAS.1998.699033
Filename
699033
Link To Document