How to explore your opponent´s strategy (almost) optimally

Author

Carmel, David ; Markovitch, Shad

Author_Institution

Dept. of Comput. Sci., Technion-Israel Inst. of Technol., Haifa, Israel

fYear

1998

fDate

3-7 Jul 1998

Firstpage

64

Lastpage

71

Abstract

Presents a lookahead-based exploration strategy for a model-based learning agent that enables exploration of the opponent´s behavior during interaction in a multi-agent system. Instead of holding one model, the model-based agent maintains a mixed opponent model, a distribution over a set of models that reflects its uncertainty about the opponent´s strategy. Every action is evaluated according to its long run contribution to the expected utility and to the knowledge regarding the opponent´s strategy. We present an efficient algorithm that returns an almost optimal exploration strategy against a given mixed model, and a learning method for acquiring a mixed model consistent with the opponent´s past behavior. We report experimental results in the Iterated Prisoner´s Dilemma game that demonstrate the superiority of the lookahead-based exploration strategy over other exploration methods

Keywords

game theory; learning automata; software agents; Iterated Prisoner´s Dilemma game; exploration strategy; lookahead; mixed opponent model; model-based learning agent; multi-agent system; Books; Computer science; Costs; Economic forecasting; History; Laboratories; Learning automata; Learning systems; Multiagent systems; Neural networks; Power generation economics; Predictive models; Uncertainty; Utility theory;

fLanguage

English

Publisher

ieee

Conference_Titel

Multi Agent Systems, 1998. Proceedings. International Conference on

Conference_Location

Paris

Print_ISBN

0-8186-8500-X

Type

conf

DOI

10.1109/ICMAS.1998.699033

Filename

699033