DocumentCode
239131
Title
Differential Evolution algorithm applied to non-stationary bandit problem
Author
St-Pierre, David L. ; Jialin Liu
Author_Institution
Univ. of Liege, Liege, Belgium
fYear
2014
fDate
6-11 July 2014
Firstpage
2397
Lastpage
2403
Abstract
In this paper we compare Differential Evolution (DE), an evolutionary algorithm, to classical bandit algorithms over the non-stationary bandit problem. First we define a testcase where the variation of the distributions depends on the number of times an option is evaluated rather than over time. This definition allows the possibility to apply these algorithms over a wide range of problems such as black-box portfolio selection. Second we present our own variant of discounted Upper Confidence Bound (UCB) algorithm that outperforms the current state-of-the-art algorithms for the non-stationary bandit problem. Third, we introduce a variant of DE and show that, on a selection over a portfolio of solvers for the Cart-Pole problem, our version of DE outperforms the current best UCB algorithms.
Keywords
evolutionary computation; DE; UCB algorithm; black-box portfolio selection; cart-pole problem; differential evolution algorithm; nonstationary bandit problem; upper confidence bound algorithm; Evolutionary computation; Noise measurement; Optimization; Portfolios; Sociology; Statistics; Tuning;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation (CEC), 2014 IEEE Congress on
Conference_Location
Beijing
Print_ISBN
978-1-4799-6626-4
Type
conf
DOI
10.1109/CEC.2014.6900505
Filename
6900505
Link To Document