DocumentCode :
2454337
Title :
On-Line Adaptation of Exploration in the One-Armed Bandit with Covariates Problem
Author :
Sykulski, Adam M. ; Adams, Niall M. ; Jennings, Nicholas R.
Author_Institution :
Inst. for Math. Sci., Imperial Coll. London, London, UK
fYear :
2010
fDate :
12-14 Dec. 2010
Firstpage :
459
Lastpage :
464
Abstract :
Many sequential decision making problems require an agent to balance exploration and exploitation to maximise long-term reward. Existing policies that address this tradeoff typically have parameters that are set a priori to control the amount of exploration. In finite-time problems, the optimal values of these parameters are highly dependent on the problem faced. In this paper, we propose adapting the amount of exploration performed on-line, as information is gathered by the agent. To this end we introduce a novel algorithm, e-ADAPT, which has no free parameters. The algorithm adapts as it plays and sequentially chooses whether to explore or exploit, driven by the amount of uncertainty in the system. We provide simulation results for the one armed bandit with covariates problem, which demonstrate the effectiveness of e-ADAPT to correctly control the amount of exploration in finite-time problems and yield rewards that are close to optimally tuned off-line policies. Furthermore, we show that e-ADAPT is robust to a high-dimensional covariate, as well as misspecified models. Finally, we describe how our methods could be extended to other sequential decision making problems, such as dynamic bandit problems with changing reward structures.
Keywords :
covariance analysis; decision making; decision theory; iterative methods; multi-agent systems; optimisation; ε-ADAPT; decision making; exploitation; exploration; finite-time problems; one-armed bandit; online adaptation; Approximation methods; Estimation; Games; Machine learning; Optimized production technology; Radio frequency; Uncertainty; Exploration-exploitation tradeoff; on-line learning; one-armed bandit problem; sequential decision making;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-9211-4
Type :
conf
DOI :
10.1109/ICMLA.2010.74
Filename :
5708871
Link To Document :
بازگشت