مرکز منطقه ای اطلاع رساني علوم و فناوري - αβ-based play-outs in Monte-Carlo Tree Search

DocumentCode :

3476651

Title :

αβ-based play-outs in Monte-Carlo Tree Search

Author :

Winands, Mark H. M. ; Bjornsson, Yngvi

Author_Institution :

Dept. of Knowledge Eng., Maastricht Univ., Maastricht, Netherlands

fYear :

2011

fDate :

Aug. 31 2011-Sept. 3 2011

Firstpage :

110

Lastpage :

117

Abstract :

Monte-Carlo Tree Search (MCTS) is a recent paradigm for game-tree search, which gradually builds a game-tree in a best-first fashion based on the results of randomized simulation play-outs. The performance of such an approach is highly dependent on both the total number of simulation play-outs and their quality. The two metrics are, however, typically inversely correlated - improving the quality of the play-outs generally involves adding knowledge that requires extra computation, thus allowing fewer play-outs to be performed per time unit. The general practice in MCTS seems to be more towards using relatively knowledge-light play-out strategies for the benefit of getting additional simulations done. In this paper we show, for the game Lines of Action (LOA), that this is not necessarily the best strategy. The newest version of our simulation-based LOA program, MC-LOA_αβ, uses a selective 2-ply αβ-search at each step in its play-outs for choosing a move. Even though this reduces the number of simulations by more than a factor of two, the new version outperforms previous versions by a large margin - achieving a winning score of approximately 60%.

Keywords :

Monte Carlo methods; game theory; tree searching; αβ based play outs; Monte Carlo tree search; game tree search; knowledge light play out strategies; lines of action; randomized simulation play outs; Backpropagation; Computational intelligence; Computational modeling; Computers; Games; Monte Carlo methods; Probability;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computational Intelligence and Games (CIG), 2011 IEEE Conference on

Conference_Location :

Seoul

Print_ISBN :

978-1-4577-0010-1

Electronic_ISBN :

978-1-4577-0009-5

Type :

conf

DOI :

10.1109/CIG.2011.6031996

Filename :

6031996

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3476651