Title :
Multiagent reinforcement learning method with an improved ant colony system
Author :
Sun, Ruoying ; Tatsumi, Shoji ; Zhao, Gang
Author_Institution :
Fac. of Eng., Osaka City Univ., Japan
Abstract :
Multiagent reinforcement learning has gained increasing attention in recent years. The authors discuss coordination means for sharing episodes and sharing policies in the field of multiagent reinforcement learning. From the point of the view of reinforcement learning, we analyse the performance of indirect media communication among multi-agents on an ant colony system which is an efficient method that uses pheromones to solve optimization problems. Based on the above, we propose the Q-ACS method, modifying the global updating rule in ACS for learning agents to share better episodes benefited from the exploitation of accumulated knowledge. Meanwhile, taking the visited times into account, we propose T-ACS by presenting a state transition policy for learning agents to share better policies, benefiting from biased exploration. To demonstrate the coordination performance of learning agents in our methods, we conducted experiments on an optimization problem, the traveling salesman problem. Comparison of results with ACS, Q-ACS and T-ACS show that the improved methods are efficient for solving the optimization problem
Keywords :
learning (artificial intelligence); multi-agent systems; travelling salesman problems; ACS; Q-ACS method; T-ACS; accumulated knowledge; ant colony system; biased exploration; coordination means; coordination performance; global updating rule; indirect media communication; learning agents; multiagent reinforcement learning; optimization problems; pheromones; state transition policy; Ant colony optimization; Distributed computing; Explosives; Internet; Learning; Optimization methods; Personal digital assistants; Sun; Telephony; Traveling salesman problems;
Conference_Titel :
Systems, Man, and Cybernetics, 2001 IEEE International Conference on
Conference_Location :
Tucson, AZ
Print_ISBN :
0-7803-7087-2
DOI :
10.1109/ICSMC.2001.973515