DocumentCode
3528136
Title
A distributed learning algorithm with bit-valued communications for multi-agent welfare optimization
Author
Menon, Ashok ; Baras, John S.
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of Maryland, College Park, MD, USA
fYear
2013
fDate
10-13 Dec. 2013
Firstpage
2406
Lastpage
2411
Abstract
A multi-agent system comprising N agents, each picking actions from a finite set and receiving a payoff that depends on the action of the whole, is considered. The exact form of the payoffs are unknown and only their values can be measured by the respective agents. A decentralized algorithm was proposed by Marden et al. [1] and in the authors´ earlier work [2] that, in this setting, leads to the agents picking welfare optimizing actions under some restrictive assumptions on the payoff structure. This algorithm is modified in this paper to incorporate exchange of certain bit-valued information between the agents over a directed communication graph. The notion of an interaction graph is then introduced to encode known interaction in the system. Restrictions on the payoff structure are eliminated and conditions that guarantee convergence to welfare minimizing actions w.p. 1 are derived under the assumption that the union of the interaction graph and communication graph is strongly connected.
Keywords
directed graphs; learning (artificial intelligence); multi-agent systems; optimisation; bit-valued communications; decentralized algorithm; directed communication graph; distributed learning algorithm; interaction graph; multi-agent welfare optimization; Algorithm design and analysis; Convergence; Games; Markov processes; Resistance; Robots; Wind turbines;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control (CDC), 2013 IEEE 52nd Annual Conference on
Conference_Location
Firenze
ISSN
0743-1546
Print_ISBN
978-1-4673-5714-2
Type
conf
DOI
10.1109/CDC.2013.6760240
Filename
6760240
Link To Document