DocumentCode
2743540
Title
Reinforcement Distribution in a Team of Cooperative Q-learning Agents
Author
Abbasi, Zahra ; Abbasi, Mohammad Ali
Author_Institution
Islamic Azad Univ., Tehran
fYear
2008
fDate
6-8 Aug. 2008
Firstpage
154
Lastpage
160
Abstract
In a Q-learning multi-agent group, agents cooperate each other to perform their assigned task during their learning for increasing the team performance. If the role of each agent clearly specified -which is a very hard task for a supervisor agent- the team will learn more efficiently. Indeed, in this case each agent reinforced according to its real effect on the team performance. Assuming an identical role for all agents is the most prevalent technique of current researchers to escape the modeling complexities. But we believe this is not the optimum method for reinforcement distribution. The main goal of this research is to find an indirect evaluation method which evaluates the role of each agent in the team and distributes the reinforcement signal accordingly. The expertness of each agent is used as a criterion to estimate the effect of each agentpsilas action on the team performance. Random and equal reinforcement signal distribution methods are also used in order to evaluate expertness-based reinforcement sharing. In addition, a new test bed, called EPIDEM, is developed to evaluate the proposed methods. The results show the distribution of the reinforcement signals based on the proposed method improves the team learning speed.
Keywords
learning (artificial intelligence); multi-agent systems; cooperative Q-learning multi agent group; equal reinforcement signal distribution; indirect evaluation method; random reinforcement signal distribution; reinforcement distribution; team performance; Artificial intelligence; Cooperative systems; Distributed computing; Humans; Multiagent systems; Process design; Protocols; Software engineering; Testing; Uncertainty; Agent learning; Cooperative distributed problem; Coordination; Multiagent Systems; Multiagent learning; and adaptation; and teamwork; cooperation; evolution; solving;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2008. SNPD '08. Ninth ACIS International Conference on
Conference_Location
Phuket
Print_ISBN
978-0-7695-3263-9
Type
conf
DOI
10.1109/SNPD.2008.154
Filename
4617364
Link To Document