Title of article
Optimal Bayesian strategies for the infinite-armed Bernoulli bandit
Author/Authors
Hung، نويسنده , , Ying-Chao، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2012
Pages
9
From page
86
To page
94
Abstract
We consider the bandit problem with an infinite number of Bernoulli arms, of which the unknown parameters are assumed to be i.i.d. random variables with a common distribution F. Our goal is to construct optimal strategies of choosing “arms” so that the expected long-run failure rate is minimized. We first review a class of strategies and establish their asymptotic properties when F is known. Based on the results, we propose a new strategy and prove that it is asymptotically optimal when F is unknown. Finally, we show that the proposed strategy performs well for a number of simulation scenarios.
Keywords
Bandit problem , Bernoulli arms , Prior distribution , Bayesian strategy
Journal title
Journal of Statistical Planning and Inference
Serial Year
2012
Journal title
Journal of Statistical Planning and Inference
Record number
2221690
Link To Document