DocumentCode :
1784732
Title :
Heuristic-Based Multi-Agent Monte Carlo Tree Search
Author :
Galvan-Lopez, Edgar ; Ruohua Li ; Patsakis, Constantinos ; Clarke, Steven ; Cahill, Vinny
Author_Institution :
Distrib. Syst. Group, Trinity Coll. Dublin, Dublin, Ireland
fYear :
2014
fDate :
7-9 July 2014
Firstpage :
177
Lastpage :
182
Abstract :
Monte Carlo Tree Search (MCTS) is a relatively new sampling best-first method to search for optimal decisions. The MCTS´ popularity is based on its extraordinary results in the challenging two-player based game Go, a game considered much harder than Chess and that until very recently was considered unfeasible for Artificial Intelligence methods. Different MCTS variants have been proposed, mainly to enhance its capabilities. Perhaps, one of the main limitations of this approach is its applicability in scenarios where multiple agents (more than two) are required. Some works have made an attempt to overcome this limitation by using a vector of reward values for each agent and allowing the algorithm to find an optimal equilibrium strategy. Inspired by these approaches, in this work we make an effort to explore a new proposal for handling multiple agents in MCTS by using a vector of values of what the agents need to do (defined tasks) instead of a vector of rewards for each agent. To achieve this we use a rather simple, but powerful heuristic that estimates the desired values of this vector. That is, a set of values that could lead to the optimal completion of the task. We tested this idea in a real-world scenario rather than using it in games as traditionally done. The results achieved by our proposed approach, named Heuristic-Based Multi-Agent Monte Carlo Tree Search, indicate the feasibility of using heuristics in the MCTS algorithm in situations where more than two agents are required.
Keywords :
Monte Carlo methods; game theory; multi-agent systems; sampling methods; tree searching; vectors; Go; MCTS algorithm; artificial intelligence methods; heuristic-based multi-agent Monte Carlo tree search; multiple agents; optimal equilibrium strategy; real-world scenario; reward values; sampling best-first method; two-player based game; vector; Bars; Batteries; Games; Monte Carlo methods; Peak to average power ratio; System-on-chip; Vectors; Demand-Side Management Systems; Heuristics; Monte Carlo Tree Search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information, Intelligence, Systems and Applications, IISA 2014, The 5th International Conference on
Conference_Location :
Chania
Type :
conf
DOI :
10.1109/IISA.2014.6878747
Filename :
6878747
Link To Document :
بازگشت