• DocumentCode
    3528136
  • Title

    A distributed learning algorithm with bit-valued communications for multi-agent welfare optimization

  • Author

    Menon, Ashok ; Baras, John S.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Maryland, College Park, MD, USA
  • fYear
    2013
  • fDate
    10-13 Dec. 2013
  • Firstpage
    2406
  • Lastpage
    2411
  • Abstract
    A multi-agent system comprising N agents, each picking actions from a finite set and receiving a payoff that depends on the action of the whole, is considered. The exact form of the payoffs are unknown and only their values can be measured by the respective agents. A decentralized algorithm was proposed by Marden et al. [1] and in the authors´ earlier work [2] that, in this setting, leads to the agents picking welfare optimizing actions under some restrictive assumptions on the payoff structure. This algorithm is modified in this paper to incorporate exchange of certain bit-valued information between the agents over a directed communication graph. The notion of an interaction graph is then introduced to encode known interaction in the system. Restrictions on the payoff structure are eliminated and conditions that guarantee convergence to welfare minimizing actions w.p. 1 are derived under the assumption that the union of the interaction graph and communication graph is strongly connected.
  • Keywords
    directed graphs; learning (artificial intelligence); multi-agent systems; optimisation; bit-valued communications; decentralized algorithm; directed communication graph; distributed learning algorithm; interaction graph; multi-agent welfare optimization; Algorithm design and analysis; Convergence; Games; Markov processes; Resistance; Robots; Wind turbines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control (CDC), 2013 IEEE 52nd Annual Conference on
  • Conference_Location
    Firenze
  • ISSN
    0743-1546
  • Print_ISBN
    978-1-4673-5714-2
  • Type

    conf

  • DOI
    10.1109/CDC.2013.6760240
  • Filename
    6760240