Title :
An analysis of optimistic, best-first search for minimax sequential decision making
Author :
Busoniu, L. ; Munos, Remi ; Pall, Elod
Author_Institution :
Dept. of Autom., Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
Abstract :
We consider problems in which a maximizer and a minimizer agent take actions in turn, such as games or optimal control with uncertainty modeled as an opponent. We extend the ideas of optimistic optimization to this setting, obtaining a search algorithm that has been previously considered as the best-first search variant of the B* method. We provide a novel analysis of the algorithm relying on a certain structure for the values of action sequences, under which earlier actions are more important than later ones. An asymptotic branching factor is defined as a measure of problem complexity, and it is used to characterize the relationship between computation invested and near-optimality. In particular, when action importance decreases exponentially, convergence rates are obtained. Throughout, examples illustrate analytical concepts such as the branching factor. In an empirical study, we compare the optimistic best-first algorithm with two classical game tree search methods, and apply it to a challenging HIV infection control problem.
Keywords :
decision making; diseases; game theory; optimal control; tree searching; B* method; HIV infection control problem; asymptotic branching factor; best-first search; classical game tree search methods; maximizer agent; minimax sequential decision making; minimizer agent; optimal control; optimistic analysis; optimistic best-first algorithm; optimistic optimization; problem complexity; search algorithm; uncertainty modeled; Algorithm design and analysis; Complexity theory; Games; Optimal control; Optimization; Uncertainty; Upper bound;
Conference_Titel :
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
DOI :
10.1109/ADPRL.2014.7010615