Rminimax: An Optimally Randomized MINIMAX Algorithm

Author

DiÌez, Silvia GarciÌa ; Laforge, JeÌroÌ‚me ; Saerens, Marco

Author_Institution

Dept. of Inf. Syst., Univ. catholique de Louvain, Louvain-la-Neuve, Belgium

Volume

43

Issue

1

fYear

2013

fDate

Feb. 2013

Firstpage

385

Lastpage

393

Abstract

This paper proposes a simple extension of the celebrated MINIMAX algorithm used in zero-sum two-player games, called Rminimax. The Rminimax algorithm allows controlling the strength of an artificial rival by randomizing its strategy in an optimal way. In particular, the randomized shortest-path framework is applied for biasing the artificial intelligence (AI) adversary toward worse or better solutions, therefore controlling its strength. In other words, our model aims at introducing/implementing bounded rationality to the MINIMAX algorithm. This framework takes into account all possible strategies by computing an optimal tradeoff between exploration (quantified by the entropy spread in the tree) and exploitation (quantified by the expected cost to an end game) of the game tree. As opposed to other tree-exploration techniques, this new algorithm considers complete paths of a tree (strategies) where a given entropy is spread. The optimal randomized strategy is efficiently computed by means of a simple recurrence relation while keeping the same complexity as the original MINIMAX. As a result, the Rminimax implements a nondeterministic strength-adapted AI opponent for board games in a principled way, thus avoiding the assumption of complete rationality. Simulations on two common games show that Rminimax behaves as expected.

Keywords

artificial intelligence; computational complexity; entropy; game theory; minimax techniques; randomised algorithms; trees (mathematics); Rminimax algorithm; artificial intelligence; board game; bounded rationality; complexity; entropy; exploitation; game tree; nondeterministic strength-adapted AI opponent; optimal randomized strategy; optimally randomized MINIMAX algorithm; randomized shortest-path framework; recurrence relation; tree-exploration technique; zero-sum two-player game; Artificial intelligence; Boltzmann distribution; Computational modeling; Entropy; Equations; Game theory; Games; MINIMAX; randomized shortest paths (RSPs); two-player zero-sum perfect-information games;

fLanguage

English

Journal_Title

Cybernetics, IEEE Transactions on

Publisher

ieee

ISSN

2168-2267

Type

jour

DOI

10.1109/TSMCB.2012.2207951

Filename

6261563

Rminimax: An Optimally Randomized MINIMAX Algorithm

DiÌez, Silvia GarciÌa ; Laforge, JeÌroÌ‚me ; Saerens, Marco

jour

DiÌez, Silvia GarciÌa ; Laforge, JeÌroÌ‚me ; Saerens, Marco