Title :
αβ-based play-outs in Monte-Carlo Tree Search
Author :
Winands, Mark H. M. ; Bjornsson, Yngvi
Author_Institution :
Dept. of Knowledge Eng., Maastricht Univ., Maastricht, Netherlands
fDate :
Aug. 31 2011-Sept. 3 2011
Abstract :
Monte-Carlo Tree Search (MCTS) is a recent paradigm for game-tree search, which gradually builds a game-tree in a best-first fashion based on the results of randomized simulation play-outs. The performance of such an approach is highly dependent on both the total number of simulation play-outs and their quality. The two metrics are, however, typically inversely correlated - improving the quality of the play-outs generally involves adding knowledge that requires extra computation, thus allowing fewer play-outs to be performed per time unit. The general practice in MCTS seems to be more towards using relatively knowledge-light play-out strategies for the benefit of getting additional simulations done. In this paper we show, for the game Lines of Action (LOA), that this is not necessarily the best strategy. The newest version of our simulation-based LOA program, MC-LOAαβ, uses a selective 2-ply αβ-search at each step in its play-outs for choosing a move. Even though this reduces the number of simulations by more than a factor of two, the new version outperforms previous versions by a large margin - achieving a winning score of approximately 60%.
Keywords :
Monte Carlo methods; game theory; tree searching; αβ based play outs; Monte Carlo tree search; game tree search; knowledge light play out strategies; lines of action; randomized simulation play outs; Backpropagation; Computational intelligence; Computational modeling; Computers; Games; Monte Carlo methods; Probability;
Conference_Titel :
Computational Intelligence and Games (CIG), 2011 IEEE Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4577-0010-1
Electronic_ISBN :
978-1-4577-0009-5
DOI :
10.1109/CIG.2011.6031996