مرکز منطقه ای اطلاع رساني علوم و فناوري - N-Grams and the Last-Good-Reply Policy Applied in General Game Playing

DocumentCode :

1520704

Title :

N-Grams and the Last-Good-Reply Policy Applied in General Game Playing

Author :

Tak, Mandy J W ; Winands, Mark H M ; Björnsson, Yngvi

Author_Institution :

Dept. of Knowledge Eng., Maastricht Univ., Maastricht, Netherlands

Volume :

Issue :

fYear :

2012

fDate :

6/1/2012 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

The aim of general game playing (GGP) is to create programs capable of playing a wide range of different games at an expert level, given only the rules of the game. The most successful GGP programs currently employ simulation-based Monte Carlo tree search (MCTS). The performance of MCTS depends heavily on the simulation strategy used. In this paper, we introduce improved simulation strategies for GGP that we implement and test in the GGP agent CADIAPLAYER, which won the International GGP competition in both 2007 and 2008. There are two aspects to the improvements: first, we show that a simple ϵ-greedy exploration strategy works better in the simulation play-outs than the softmax-based Gibbs measure currently used in CADIAPLAYER and, second, we introduce a general framework based on N-grams for learning promising move sequences. Collectively, these enhancements result in a much improved performance of CADIAPLAYER. For example, in our test suite consisting of five different two-player turn-based games, they led to an impressive average win rate of approximately 70%. The enhancements are also shown to be effective in multiplayer and simultaneous-move games. We additionally perform experiments with the last-good-reply policy (LGRP). The LGRP combined with N-grams is also tested. The LGRP has already been shown to be successful in Go programs and we demonstrate that it also has promise in GGP.

Keywords :

Monte Carlo methods; computer games; greedy algorithms; learning (artificial intelligence); tree searching; ϵ-greedy exploration strategy; CADIAPLAYER; GGP programs; Go program; International GGP competition; LGRP; MCTS; Monte Carlo tree search; N-gram; game rule; general game playing; last good reply policy; learning; simulation strategy; Computational modeling; Games; Law; Learning systems; Monte Carlo methods; Servers; General game playing (GGP); Monte Carlo tree search (MCTS); N-grams; last-good-reply policy (LGRP);

fLanguage :

English

Journal_Title :

Computational Intelligence and AI in Games, IEEE Transactions on

Publisher :

ieee

ISSN :

1943-068X

Type :

jour

DOI :

10.1109/TCIAIG.2012.2200252

Filename :

6203383

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1520704