Title :
Heuristic-Based Multi-Agent Monte Carlo Tree Search
Author :
Galvan-Lopez, Edgar ; Ruohua Li ; Patsakis, Constantinos ; Clarke, Steven ; Cahill, Vinny
Author_Institution :
Distrib. Syst. Group, Trinity Coll. Dublin, Dublin, Ireland
Abstract :
Monte Carlo Tree Search (MCTS) is a relatively new sampling best-first method to search for optimal decisions. The MCTS´ popularity is based on its extraordinary results in the challenging two-player based game Go, a game considered much harder than Chess and that until very recently was considered unfeasible for Artificial Intelligence methods. Different MCTS variants have been proposed, mainly to enhance its capabilities. Perhaps, one of the main limitations of this approach is its applicability in scenarios where multiple agents (more than two) are required. Some works have made an attempt to overcome this limitation by using a vector of reward values for each agent and allowing the algorithm to find an optimal equilibrium strategy. Inspired by these approaches, in this work we make an effort to explore a new proposal for handling multiple agents in MCTS by using a vector of values of what the agents need to do (defined tasks) instead of a vector of rewards for each agent. To achieve this we use a rather simple, but powerful heuristic that estimates the desired values of this vector. That is, a set of values that could lead to the optimal completion of the task. We tested this idea in a real-world scenario rather than using it in games as traditionally done. The results achieved by our proposed approach, named Heuristic-Based Multi-Agent Monte Carlo Tree Search, indicate the feasibility of using heuristics in the MCTS algorithm in situations where more than two agents are required.
Keywords :
Monte Carlo methods; game theory; multi-agent systems; sampling methods; tree searching; vectors; Go; MCTS algorithm; artificial intelligence methods; heuristic-based multi-agent Monte Carlo tree search; multiple agents; optimal equilibrium strategy; real-world scenario; reward values; sampling best-first method; two-player based game; vector; Bars; Batteries; Games; Monte Carlo methods; Peak to average power ratio; System-on-chip; Vectors; Demand-Side Management Systems; Heuristics; Monte Carlo Tree Search;
Conference_Titel :
Information, Intelligence, Systems and Applications, IISA 2014, The 5th International Conference on
Conference_Location :
Chania
DOI :
10.1109/IISA.2014.6878747