DocumentCode
1799307
Title
An analysis of optimistic, best-first search for minimax sequential decision making
Author
Busoniu, L. ; Munos, Remi ; Pall, Elod
Author_Institution
Dept. of Autom., Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
fYear
2014
fDate
9-12 Dec. 2014
Firstpage
1
Lastpage
8
Abstract
We consider problems in which a maximizer and a minimizer agent take actions in turn, such as games or optimal control with uncertainty modeled as an opponent. We extend the ideas of optimistic optimization to this setting, obtaining a search algorithm that has been previously considered as the best-first search variant of the B* method. We provide a novel analysis of the algorithm relying on a certain structure for the values of action sequences, under which earlier actions are more important than later ones. An asymptotic branching factor is defined as a measure of problem complexity, and it is used to characterize the relationship between computation invested and near-optimality. In particular, when action importance decreases exponentially, convergence rates are obtained. Throughout, examples illustrate analytical concepts such as the branching factor. In an empirical study, we compare the optimistic best-first algorithm with two classical game tree search methods, and apply it to a challenging HIV infection control problem.
Keywords
decision making; diseases; game theory; optimal control; tree searching; B* method; HIV infection control problem; asymptotic branching factor; best-first search; classical game tree search methods; maximizer agent; minimax sequential decision making; minimizer agent; optimal control; optimistic analysis; optimistic best-first algorithm; optimistic optimization; problem complexity; search algorithm; uncertainty modeled; Algorithm design and analysis; Complexity theory; Games; Optimal control; Optimization; Uncertainty; Upper bound;
fLanguage
English
Publisher
ieee
Conference_Titel
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
Conference_Location
Orlando, FL
Type
conf
DOI
10.1109/ADPRL.2014.7010615
Filename
7010615
Link To Document