• DocumentCode
    2826476
  • Title

    How to explore your opponent´s strategy (almost) optimally

  • Author

    Carmel, David ; Markovitch, Shad

  • Author_Institution
    Dept. of Comput. Sci., Technion-Israel Inst. of Technol., Haifa, Israel
  • fYear
    1998
  • fDate
    3-7 Jul 1998
  • Firstpage
    64
  • Lastpage
    71
  • Abstract
    Presents a lookahead-based exploration strategy for a model-based learning agent that enables exploration of the opponent´s behavior during interaction in a multi-agent system. Instead of holding one model, the model-based agent maintains a mixed opponent model, a distribution over a set of models that reflects its uncertainty about the opponent´s strategy. Every action is evaluated according to its long run contribution to the expected utility and to the knowledge regarding the opponent´s strategy. We present an efficient algorithm that returns an almost optimal exploration strategy against a given mixed model, and a learning method for acquiring a mixed model consistent with the opponent´s past behavior. We report experimental results in the Iterated Prisoner´s Dilemma game that demonstrate the superiority of the lookahead-based exploration strategy over other exploration methods
  • Keywords
    game theory; learning automata; software agents; Iterated Prisoner´s Dilemma game; exploration strategy; lookahead; mixed opponent model; model-based learning agent; multi-agent system; Books; Computer science; Costs; Economic forecasting; History; Laboratories; Learning automata; Learning systems; Multiagent systems; Neural networks; Power generation economics; Predictive models; Uncertainty; Utility theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multi Agent Systems, 1998. Proceedings. International Conference on
  • Conference_Location
    Paris
  • Print_ISBN
    0-8186-8500-X
  • Type

    conf

  • DOI
    10.1109/ICMAS.1998.699033
  • Filename
    699033