• DocumentCode
    268119
  • Title

    Rminimax: An Optimally Randomized MINIMAX Algorithm

  • Author

    Díez, Silvia García ; Laforge, Jérôme ; Saerens, Marco

  • Author_Institution
    Dept. of Inf. Syst., Univ. catholique de Louvain, Louvain-la-Neuve, Belgium
  • Volume
    43
  • Issue
    1
  • fYear
    2013
  • fDate
    Feb. 2013
  • Firstpage
    385
  • Lastpage
    393
  • Abstract
    This paper proposes a simple extension of the celebrated MINIMAX algorithm used in zero-sum two-player games, called Rminimax. The Rminimax algorithm allows controlling the strength of an artificial rival by randomizing its strategy in an optimal way. In particular, the randomized shortest-path framework is applied for biasing the artificial intelligence (AI) adversary toward worse or better solutions, therefore controlling its strength. In other words, our model aims at introducing/implementing bounded rationality to the MINIMAX algorithm. This framework takes into account all possible strategies by computing an optimal tradeoff between exploration (quantified by the entropy spread in the tree) and exploitation (quantified by the expected cost to an end game) of the game tree. As opposed to other tree-exploration techniques, this new algorithm considers complete paths of a tree (strategies) where a given entropy is spread. The optimal randomized strategy is efficiently computed by means of a simple recurrence relation while keeping the same complexity as the original MINIMAX. As a result, the Rminimax implements a nondeterministic strength-adapted AI opponent for board games in a principled way, thus avoiding the assumption of complete rationality. Simulations on two common games show that Rminimax behaves as expected.
  • Keywords
    artificial intelligence; computational complexity; entropy; game theory; minimax techniques; randomised algorithms; trees (mathematics); Rminimax algorithm; artificial intelligence; board game; bounded rationality; complexity; entropy; exploitation; game tree; nondeterministic strength-adapted AI opponent; optimal randomized strategy; optimally randomized MINIMAX algorithm; randomized shortest-path framework; recurrence relation; tree-exploration technique; zero-sum two-player game; Artificial intelligence; Boltzmann distribution; Computational modeling; Entropy; Equations; Game theory; Games; MINIMAX; randomized shortest paths (RSPs); two-player zero-sum perfect-information games;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TSMCB.2012.2207951
  • Filename
    6261563