• DocumentCode
    1799307
  • Title

    An analysis of optimistic, best-first search for minimax sequential decision making

  • Author

    Busoniu, L. ; Munos, Remi ; Pall, Elod

  • Author_Institution
    Dept. of Autom., Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
  • fYear
    2014
  • fDate
    9-12 Dec. 2014
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    We consider problems in which a maximizer and a minimizer agent take actions in turn, such as games or optimal control with uncertainty modeled as an opponent. We extend the ideas of optimistic optimization to this setting, obtaining a search algorithm that has been previously considered as the best-first search variant of the B* method. We provide a novel analysis of the algorithm relying on a certain structure for the values of action sequences, under which earlier actions are more important than later ones. An asymptotic branching factor is defined as a measure of problem complexity, and it is used to characterize the relationship between computation invested and near-optimality. In particular, when action importance decreases exponentially, convergence rates are obtained. Throughout, examples illustrate analytical concepts such as the branching factor. In an empirical study, we compare the optimistic best-first algorithm with two classical game tree search methods, and apply it to a challenging HIV infection control problem.
  • Keywords
    decision making; diseases; game theory; optimal control; tree searching; B* method; HIV infection control problem; asymptotic branching factor; best-first search; classical game tree search methods; maximizer agent; minimax sequential decision making; minimizer agent; optimal control; optimistic analysis; optimistic best-first algorithm; optimistic optimization; problem complexity; search algorithm; uncertainty modeled; Algorithm design and analysis; Complexity theory; Games; Optimal control; Optimization; Uncertainty; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
  • Conference_Location
    Orlando, FL
  • Type

    conf

  • DOI
    10.1109/ADPRL.2014.7010615
  • Filename
    7010615