Reinforcement Distribution in a Team of Cooperative Q-learning Agents

Author

Abbasi, Zahra ; Abbasi, Mohammad Ali

Author_Institution

Islamic Azad Univ., Tehran

fYear

2008

fDate

6-8 Aug. 2008

Firstpage

154

Lastpage

160

Abstract

In a Q-learning multi-agent group, agents cooperate each other to perform their assigned task during their learning for increasing the team performance. If the role of each agent clearly specified -which is a very hard task for a supervisor agent- the team will learn more efficiently. Indeed, in this case each agent reinforced according to its real effect on the team performance. Assuming an identical role for all agents is the most prevalent technique of current researchers to escape the modeling complexities. But we believe this is not the optimum method for reinforcement distribution. The main goal of this research is to find an indirect evaluation method which evaluates the role of each agent in the team and distributes the reinforcement signal accordingly. The expertness of each agent is used as a criterion to estimate the effect of each agentpsilas action on the team performance. Random and equal reinforcement signal distribution methods are also used in order to evaluate expertness-based reinforcement sharing. In addition, a new test bed, called EPIDEM, is developed to evaluate the proposed methods. The results show the distribution of the reinforcement signals based on the proposed method improves the team learning speed.

Keywords

learning (artificial intelligence); multi-agent systems; cooperative Q-learning multi agent group; equal reinforcement signal distribution; indirect evaluation method; random reinforcement signal distribution; reinforcement distribution; team performance; Artificial intelligence; Cooperative systems; Distributed computing; Humans; Multiagent systems; Process design; Protocols; Software engineering; Testing; Uncertainty; Agent learning; Cooperative distributed problem; Coordination; Multiagent Systems; Multiagent learning; and adaptation; and teamwork; cooperation; evolution; solving;

fLanguage

English

Publisher

ieee

Conference_Titel

Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2008. SNPD '08. Ninth ACIS International Conference on

Conference_Location

Phuket

Print_ISBN

978-0-7695-3263-9

Type

conf

DOI

10.1109/SNPD.2008.154

Filename

4617364