DocumentCode :
1797502
Title :
Scalarization based Pareto optimal set of arms identification algorithms
Author :
Drugan, Madalina M. ; Nowe, Ann
Author_Institution :
Artificial Intell. Lab. of Comput. Sci. Dept., Vrije Univ. Brussel, Brüssel, Belgium
fYear :
2014
fDate :
6-11 July 2014
Firstpage :
2690
Lastpage :
2697
Abstract :
Multi-objective multi-armed bandits (MOMAB) is an extension of the multi-objective multi-armed bandits framework that considers reward vectors instead of scalar reward values. Scalarization functions transform the reward vectors into reward values in order to use the standard multi-armed bandits (MAB) algorithms. However for many applications it is not obvious to come up with a good scalarization set and therefore there is needed to develop MAB that discover the whole Pareto set of arms. Our approach to this multi-objective MAB problem is two folded: i) identify the set of Pareto optimal arms and ii) identify the minimum subset of scalarization functions that optimize the set of Pareto optimal arms. We experimentally compare the proposed MOMAB algorithms on a multi-objective Bernoulli problem.
Keywords :
Pareto analysis; learning (artificial intelligence); MOMAB algorithms; Pareto optimal arms set; arms identification algorithms; machine learning paradigm; multiarmed bandits; multiobjective Bernoulli problem; multiobjective MAB problem; reward vectors; scalarization functions; Algorithm design and analysis; Chebyshev approximation; Equations; Pareto optimization; Transforms; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
Type :
conf
DOI :
10.1109/IJCNN.2014.6889484
Filename :
6889484
Link To Document :
بازگشت