DocumentCode :
2665341
Title :
Tree Exploration for Bayesian RL Exploration
Author :
Dimitrakakis, Christos
Author_Institution :
Intell. Syst. Lab. Amsterdam, Univ. of Amsterdam, Amsterdam, Netherlands
fYear :
2008
fDate :
10-12 Dec. 2008
Firstpage :
1029
Lastpage :
1034
Abstract :
Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The first employs a Bayesian framework, where optimality improves with increased computational time. This is because the resulting planning task takes the form of a dynamic programming problem on a be-lief tree with an infinite number of states. The second type employs relatively simple algorithm which are shown to suffer small regret within a distribution-free framework. This paper presents a lower bound and a high probability up-per bound on the optimal value function for the nodes in the Bayesian belief tree, which are analogous to similar bounds in POMDPs. The bounds are then used to create more efficient strategies for exploring the tree. The resulting algorithms are compared with the distribution-free algorithm UCB1, as well as a simpler baseline algorithm on multi-armed bandit problems.
Keywords :
belief networks; decision making; dynamic programming; learning (artificial intelligence); trees (mathematics); uncertainty handling; Bayesian RL exploration; Bayesian belief tree; POMDP; UCB1; computational time; distribution-free algorithm; dynamic programming problem; multiarmed bandit problem; optimal decision making; optimal value function; partially-observable Markov decision process; reinforcement learning; tree exploration; uncertainty; Algorithm design and analysis; Bayesian methods; Computational intelligence; Decision making; Dynamic programming; Intelligent systems; Laboratories; Learning; Uncertainty; Upper bound; Bayesian; Exploration; Reinforcement Learning; Tree search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence for Modelling Control & Automation, 2008 International Conference on
Conference_Location :
Vienna
Print_ISBN :
978-0-7695-3514-2
Type :
conf
DOI :
10.1109/CIMCA.2008.32
Filename :
5172767
Link To Document :
بازگشت