مرکز منطقه ای اطلاع رساني علوم و فناوري - An analysis of optimistic, best-first search for minimax sequential decision making

DocumentCode :

1799307

Title :

An analysis of optimistic, best-first search for minimax sequential decision making

Author :

Busoniu, L. ; Munos, Remi ; Pall, Elod

Author_Institution :

Dept. of Autom., Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania

fYear :

2014

fDate :

9-12 Dec. 2014

Firstpage :

Lastpage :

Abstract :

We consider problems in which a maximizer and a minimizer agent take actions in turn, such as games or optimal control with uncertainty modeled as an opponent. We extend the ideas of optimistic optimization to this setting, obtaining a search algorithm that has been previously considered as the best-first search variant of the B* method. We provide a novel analysis of the algorithm relying on a certain structure for the values of action sequences, under which earlier actions are more important than later ones. An asymptotic branching factor is defined as a measure of problem complexity, and it is used to characterize the relationship between computation invested and near-optimality. In particular, when action importance decreases exponentially, convergence rates are obtained. Throughout, examples illustrate analytical concepts such as the branching factor. In an empirical study, we compare the optimistic best-first algorithm with two classical game tree search methods, and apply it to a challenging HIV infection control problem.

Keywords :

decision making; diseases; game theory; optimal control; tree searching; B* method; HIV infection control problem; asymptotic branching factor; best-first search; classical game tree search methods; maximizer agent; minimax sequential decision making; minimizer agent; optimal control; optimistic analysis; optimistic best-first algorithm; optimistic optimization; problem complexity; search algorithm; uncertainty modeled; Algorithm design and analysis; Complexity theory; Games; Optimal control; Optimization; Uncertainty; Upper bound;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on

Conference_Location :

Orlando, FL

Type :

conf

DOI :

10.1109/ADPRL.2014.7010615

Filename :

7010615

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1799307