DocumentCode
268119
Title
Rminimax: An Optimally Randomized MINIMAX Algorithm
Author
DiÌez, Silvia GarciÌa ; Laforge, JeÌroÌ‚me ; Saerens, Marco
Author_Institution
Dept. of Inf. Syst., Univ. catholique de Louvain, Louvain-la-Neuve, Belgium
Volume
43
Issue
1
fYear
2013
fDate
Feb. 2013
Firstpage
385
Lastpage
393
Abstract
This paper proposes a simple extension of the celebrated MINIMAX algorithm used in zero-sum two-player games, called Rminimax. The Rminimax algorithm allows controlling the strength of an artificial rival by randomizing its strategy in an optimal way. In particular, the randomized shortest-path framework is applied for biasing the artificial intelligence (AI) adversary toward worse or better solutions, therefore controlling its strength. In other words, our model aims at introducing/implementing bounded rationality to the MINIMAX algorithm. This framework takes into account all possible strategies by computing an optimal tradeoff between exploration (quantified by the entropy spread in the tree) and exploitation (quantified by the expected cost to an end game) of the game tree. As opposed to other tree-exploration techniques, this new algorithm considers complete paths of a tree (strategies) where a given entropy is spread. The optimal randomized strategy is efficiently computed by means of a simple recurrence relation while keeping the same complexity as the original MINIMAX. As a result, the Rminimax implements a nondeterministic strength-adapted AI opponent for board games in a principled way, thus avoiding the assumption of complete rationality. Simulations on two common games show that Rminimax behaves as expected.
Keywords
artificial intelligence; computational complexity; entropy; game theory; minimax techniques; randomised algorithms; trees (mathematics); Rminimax algorithm; artificial intelligence; board game; bounded rationality; complexity; entropy; exploitation; game tree; nondeterministic strength-adapted AI opponent; optimal randomized strategy; optimally randomized MINIMAX algorithm; randomized shortest-path framework; recurrence relation; tree-exploration technique; zero-sum two-player game; Artificial intelligence; Boltzmann distribution; Computational modeling; Entropy; Equations; Game theory; Games; MINIMAX; randomized shortest paths (RSPs); two-player zero-sum perfect-information games;
fLanguage
English
Journal_Title
Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
2168-2267
Type
jour
DOI
10.1109/TSMCB.2012.2207951
Filename
6261563
Link To Document