DocumentCode
1798025
Title
Policy gradient approaches for multi-objective sequential decision making
Author
Parisi, Simone ; Pirotta, Matteo ; Smacchia, Nicola ; Bascetta, Luca ; Restelli, Marcello
Author_Institution
Dept. of Electron., Inf. & Bioeng., Politec. di Milano, Milan, Italy
fYear
2014
fDate
6-11 July 2014
Firstpage
2323
Lastpage
2330
Abstract
This paper investigates the use of policy gradient techniques to approximate the Pareto frontier in Multi-Objective Markov Decision Processes (MOMDPs). Despite the popularity of policy gradient algorithms and the fact that gradient ascent algorithms have been already proposed to numerically solve multi-objective optimization problems, especially in combination with multi-objective evolutionary algorithms, so far little attention has been paid to the use of gradient information to face multi-objective sequential decision problems. Two different Multi-Objective Reinforcement-Learning (MORL) approaches, called radial and Pareto following, that, starting from an initial policy, perform gradient-based policy-search procedures aimed at finding a set of non-dominated policies are here presented. Both algorithms are empirically evaluated and compared to state-of-the-art MORL algorithms on three MORL benchmark problems.
Keywords
Markov processes; Pareto optimisation; approximation theory; decision making; evolutionary computation; gradient methods; learning (artificial intelligence); MOMDP; MORL approach; MORL benchmark problems; Pareto following; Pareto frontier approximation; gradient ascent algorithm; gradient-based policy-search procedure; multiobjective Markov decision process; multiobjective evolutionary algorithm; multiobjective optimization problems; multiobjective reinforcement-learning approach; multiobjective sequential decision making; nondominated policies; policy gradient approach; radial following; Approximation algorithms; Approximation methods; Covariance matrices; Optimization; Search problems; Vectors; Water resources;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4799-6627-1
Type
conf
DOI
10.1109/IJCNN.2014.6889738
Filename
6889738
Link To Document