Title :
Scalarization based Pareto optimal set of arms identification algorithms
Author :
Drugan, Madalina M. ; Nowe, Ann
Author_Institution :
Artificial Intell. Lab. of Comput. Sci. Dept., Vrije Univ. Brussel, Brüssel, Belgium
Abstract :
Multi-objective multi-armed bandits (MOMAB) is an extension of the multi-objective multi-armed bandits framework that considers reward vectors instead of scalar reward values. Scalarization functions transform the reward vectors into reward values in order to use the standard multi-armed bandits (MAB) algorithms. However for many applications it is not obvious to come up with a good scalarization set and therefore there is needed to develop MAB that discover the whole Pareto set of arms. Our approach to this multi-objective MAB problem is two folded: i) identify the set of Pareto optimal arms and ii) identify the minimum subset of scalarization functions that optimize the set of Pareto optimal arms. We experimentally compare the proposed MOMAB algorithms on a multi-objective Bernoulli problem.
Keywords :
Pareto analysis; learning (artificial intelligence); MOMAB algorithms; Pareto optimal arms set; arms identification algorithms; machine learning paradigm; multiarmed bandits; multiobjective Bernoulli problem; multiobjective MAB problem; reward vectors; scalarization functions; Algorithm design and analysis; Chebyshev approximation; Equations; Pareto optimization; Transforms; Vectors;
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
DOI :
10.1109/IJCNN.2014.6889484