• DocumentCode
    2770897
  • Title

    Coordinating many agents in stochastic games

  • Author

    Bazzan, Ana L C

  • Author_Institution
    Inst. de Inf., Fed. Univ. of Rio Grande do Sul, Rio Grande, Brazil
  • fYear
    2012
  • fDate
    10-15 June 2012
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Learning in coordination games has been extensively studied in the game theory and multi-agent learning literature. Most of this work has considered a low number of agents and/or states (typically two agent, two action games). When the number of states and/or joint actions increases, standard approaches for multi-agent learning have difficulties coping with a high number of agents due to the combinatorial explosion in the number of joint actions and joint states. In real-world applications, this is a common setting though. This paper introduces a methodology for learning to coordinate in stochastic games with many agents. More specifically, we introduce a structure where some agents have knowledge about joint actions and how they have performed in the past. We empirically investigate this method for multi-agent learning in a typical stochastic game involving a high number of agents. Experimental results show that the additional information and structure is translated into earlier and higher levels of coordination and thus to higher payoffs.
  • Keywords
    combinatorial mathematics; game theory; learning (artificial intelligence); multi-agent systems; agent coordination; combinatorial explosion; coordination games; game theory; joint actions; joint states; multiagent learning literature; stochastic games; Convergence; Games; Joints; Learning; Monitoring; Silicon; Stochastic processes; Coordination in multiagent systems; Multiagent learning; Stochastic games;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2012 International Joint Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4673-1488-6
  • Electronic_ISBN
    2161-4393
  • Type

    conf

  • DOI
    10.1109/IJCNN.2012.6252457
  • Filename
    6252457