DocumentCode :
3255722
Title :
Thompson Sampling for Dynamic Multi-armed Bandits
Author :
Gupta, Neha ; Granmo, Ole-Christoffer ; Agrawala, Ashok
Author_Institution :
Dept. of Comput. Sci., Univ. of Maryland, College Park, MD, USA
Volume :
1
fYear :
2011
fDate :
18-21 Dec. 2011
Firstpage :
484
Lastpage :
489
Abstract :
The importance of multi-armed bandit (MAB) problems is on the rise due to their recent application in a large variety of areas such as online advertising, news article selection, wireless networks, and medicinal trials, to name a few. The most common assumption made when solving such MAB problems is that the unknown reward probability θk of each bandit arm k is fixed. However, this assumption rarely holds in practice simply because real-life problems often involve underlying processes that are dynamically evolving. In this paper, we model problems where reward probabilities θk are drifting, and introduce a new method called Dynamic Thompson Sampling (DTS) that facilitates Order Statistics based Thompson Sampling for these dynamically evolving MABs. The DTS algorithm adapts its success probability estimates, hat θk, faster than traditional Thompson Sampling schemes and thus leads to improved performance in terms of lower regret. Extensive experiments demonstrate that DTS outperforms current state-of-the-art approaches, namely pure Thompson Sampling, UCB-Normal and UCBf, for the case of dynamic reward probabilities. Furthermore, this performance advantage increases persistently with the number of bandit arms.
Keywords :
learning (artificial intelligence); probability; sampling methods; DTS; MAB problem; UCB-Normal; UCBf; dynamic Thompson sampling; dynamic multiarmed bandit; order statistics; reward probability; Bayesian methods; Dynamics; Educational institutions; Electronic mail; Heuristic algorithms; Random variables; Bayesian Techniques; Learning Algorithms; Multi-Armed Bandits;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications and Workshops (ICMLA), 2011 10th International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
978-1-4577-2134-2
Type :
conf
DOI :
10.1109/ICMLA.2011.144
Filename :
6147024
Link To Document :
بازگشت