مرکز منطقه ای اطلاع رساني علوم و فناوري - Globally Optimal Multi-agent Reinforcement Learning Parameters in Distributed Task Assignment

DocumentCode :

1839408

Title :

Globally Optimal Multi-agent Reinforcement Learning Parameters in Distributed Task Assignment

Author :

Dahlem, Dominik ; Harrison, William

Volume :

fYear :

2009

fDate :

15-18 Sept. 2009

Firstpage :

Lastpage :

Abstract :

Large-scale simulation studies are necessary to study the learning behaviour of individual agents and the overall system dynamics. One reason is that planning algorithms to find optimal solutions to fully observable general decentralised Markov decision problems do not admit to polynomial-time worst-case complexity bounds. Additionally, agent interaction often implies a non-stationary environment which does not lend itself to asymptotically greedy policies. Therefore, policies with a constant level of exploration are required to be able to adapt continuously. This paper casts the application domain of distributed task assignment into the formalisms of queueing theory, complex networks and decentralised Markov decision problems to analyse the impact of the momentum of a standard back-propagation neural network function approximator and the discount factor of $SARSA(0)$ reinforcement learning and the $epsilon$ parameter of the $epsilon$-greedy policy. For this purpose large queueing networks of one thousand interacting agents are evolved. A Kriging metamodel is fitted and in combination with simulated annealing optimal operating conditions with respect to the total average response time are found. The insights gained from this study are significant in that they provide guidance in deploying large-scale distributed task assignment systems modelled as multi-agent queueing networks.

Keywords :

Computational modeling; Computer science; Computer simulation; Conferences; Delay; Educational institutions; Intelligent agent; Learning; Network servers; Statistical distributions; Kriging; Markov Decision Problem; Multi-agent Reinforcement Learning; Queueing Networks;

fLanguage :

English

Publisher :

iet

Conference_Titel :

Web Intelligence and Intelligent Agent Technologies, 2009. WI-IAT '09. IEEE/WIC/ACM International Joint Conferences on

Conference_Location :

Milan, Italy

Print_ISBN :

978-0-7695-3801-3

Electronic_ISBN :

978-1-4244-5331-3

Type :

conf

DOI :

10.1109/WI-IAT.2009.122

Filename :

5284868

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1839408