DocumentCode :
3528136
Title :
A distributed learning algorithm with bit-valued communications for multi-agent welfare optimization
Author :
Menon, Ashok ; Baras, John S.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Maryland, College Park, MD, USA
fYear :
2013
fDate :
10-13 Dec. 2013
Firstpage :
2406
Lastpage :
2411
Abstract :
A multi-agent system comprising N agents, each picking actions from a finite set and receiving a payoff that depends on the action of the whole, is considered. The exact form of the payoffs are unknown and only their values can be measured by the respective agents. A decentralized algorithm was proposed by Marden et al. [1] and in the authors´ earlier work [2] that, in this setting, leads to the agents picking welfare optimizing actions under some restrictive assumptions on the payoff structure. This algorithm is modified in this paper to incorporate exchange of certain bit-valued information between the agents over a directed communication graph. The notion of an interaction graph is then introduced to encode known interaction in the system. Restrictions on the payoff structure are eliminated and conditions that guarantee convergence to welfare minimizing actions w.p. 1 are derived under the assumption that the union of the interaction graph and communication graph is strongly connected.
Keywords :
directed graphs; learning (artificial intelligence); multi-agent systems; optimisation; bit-valued communications; decentralized algorithm; directed communication graph; distributed learning algorithm; interaction graph; multi-agent welfare optimization; Algorithm design and analysis; Convergence; Games; Markov processes; Resistance; Robots; Wind turbines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control (CDC), 2013 IEEE 52nd Annual Conference on
Conference_Location :
Firenze
ISSN :
0743-1546
Print_ISBN :
978-1-4673-5714-2
Type :
conf
DOI :
10.1109/CDC.2013.6760240
Filename :
6760240
Link To Document :
بازگشت