DocumentCode
1798057
Title
Multi-objective χ-Armed bandits
Author
Van Moffaert, K. ; Van Vaerenbergh, Kevin ; Vrancx, Peter ; Nowe, Ann
Author_Institution
Dept. of Comput. Sci., Vrije Univ. Brussel, Brussels, Belgium
fYear
2014
fDate
6-11 July 2014
Firstpage
2331
Lastpage
2338
Abstract
Many of the standard optimization algorithms focus on optimizing a single, scalar feedback signal. However, real-life optimization problems often require a simultaneous optimization of more than one objective. In this paper, we propose a multi-objective extension to the standard χ-armed bandit problem. As the feedback signal is now vector-valued, the goal of the agent is to sample actions in the Pareto dominating area of the objective space. Therefore, we propose the multi-objective Hierarchical Optimistic Optimization strategy that discretizes the continuous action space in relation to the Pareto optimal solutions obtained in the multi-objective objective space. We experimentally validate the approach on two well-known multi-objective test functions and a simulation of a real life application, the filling phase of a wet clutch. We demonstrate that the strategy allows to identify the Pareto front after just a few epochs and to sample accordingly. After learning, several multi-objective quality indicators indicate that the set of sampled solutions by the algorithm very closely approximates the Pareto front.
Keywords
Pareto optimisation; feedback; Pareto dominating area; Pareto front; Pareto optimal solutions; continuous action space; multiobjective χ-armed bandits; multiobjective extension; multiobjective hierarchical optimistic optimization strategy; multiobjective quality indicators; multiobjective test functions; optimization algorithms; scalar feedback signal; vector-valued feedback signal; Approximation algorithms; Pareto optimization; Pistons; Shafts; Torque; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4799-6627-1
Type
conf
DOI
10.1109/IJCNN.2014.6889753
Filename
6889753
Link To Document