DocumentCode
6116
Title
Monte Carlo Search Algorithm Discovery for Single-Player Games
Author
Maes, Frederik ; St-Pierre, David L. ; Ernst, Damien
Author_Institution
Syst. & Modeling Res. Unit, Univ. of Liege, Liege, Belgium
Volume
5
Issue
3
fYear
2013
fDate
Sept. 2013
Firstpage
201
Lastpage
213
Abstract
Much current research in AI and games is being devoted to Monte Carlo search (MCS) algorithms. While the quest for a single unified MCS algorithm that would perform well on all problems is of major interest for AI, practitioners often know in advance the problem they want to solve, and spend plenty of time exploiting this knowledge to customize their MCS algorithm in a problem-driven way. We propose an MCS algorithm discovery scheme to perform this in an automatic and reproducible way. First, we introduce a grammar over MCS algorithms that enables inducing a rich space of candidate algorithms. Afterwards, we search in this space for the algorithm that performs best on average for a given distribution of training problems. We rely on multiarmed bandits to approximately solve this optimization problem. The experiments, generated on three different domains, show that our approach enables discovering algorithms that outperform several well-known MCS algorithms such as upper confidence bounds applied to trees and nested Monte Carlo search. We also show that the discovered algorithms are generally quite robust with respect to changes in the distribution over the training problems.
Keywords
Monte Carlo methods; algorithm theory; game theory; optimisation; search problems; MCS algorithm; Monte Carlo search algorithm; artificial intelligence; confidence bound; optimization problem; single-player game; Approximation algorithms; Games; Generators; Grammar; Monte Carlo methods; Search problems; Training; Algorithm selection; Monte Carlo search (MCS); grammar of algorithms;
fLanguage
English
Journal_Title
Computational Intelligence and AI in Games, IEEE Transactions on
Publisher
ieee
ISSN
1943-068X
Type
jour
DOI
10.1109/TCIAIG.2013.2239295
Filename
6409426
Link To Document