• DocumentCode
    2240141
  • Title

    UCD: Upper Confidence Bound for Rooted Directed Acyclic Graphs

  • Author

    Saffidine, Abdallah ; Cazenave, Tristan ; Méhat, Jean

  • Author_Institution
    LAMSADE, Univ. Paris-Dauphine, Paris, France
  • fYear
    2010
  • fDate
    18-20 Nov. 2010
  • Firstpage
    467
  • Lastpage
    473
  • Abstract
    In this paper we present a framework for testing various algorithms that deal with transpositions in Monte-Carlo Tree Search (MCTS). When using transpositions in MCTS, a Directed Acyclic Graph (DAG) is progressively developed instead of a tree. There are multiple ways to handle the exploration exploitation dilemma when dealing with transpositions. We propose parameterized ways to compute the mean of the child, the playouts of the parent and the playouts of the child. We test the resulting algorithms on LeftRight an abstract single player game and on Hex. For both games, original configurations of our algorithms improve on state of the art algorithms.
  • Keywords
    Monte Carlo methods; computer games; directed graphs; trees (mathematics); Hex game; LeftRight game; Monte Carlo tree search; rooted directed acyclic graphs; DAG; Monte-Carlo Tree Search; Transpositions; UCT;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Technologies and Applications of Artificial Intelligence (TAAI), 2010 International Conference on
  • Conference_Location
    Hsinchu City
  • Print_ISBN
    978-1-4244-8668-7
  • Electronic_ISBN
    978-0-7695-4253-9
  • Type

    conf

  • DOI
    10.1109/TAAI.2010.79
  • Filename
    5695494