• DocumentCode
    1798025
  • Title

    Policy gradient approaches for multi-objective sequential decision making

  • Author

    Parisi, Simone ; Pirotta, Matteo ; Smacchia, Nicola ; Bascetta, Luca ; Restelli, Marcello

  • Author_Institution
    Dept. of Electron., Inf. & Bioeng., Politec. di Milano, Milan, Italy
  • fYear
    2014
  • fDate
    6-11 July 2014
  • Firstpage
    2323
  • Lastpage
    2330
  • Abstract
    This paper investigates the use of policy gradient techniques to approximate the Pareto frontier in Multi-Objective Markov Decision Processes (MOMDPs). Despite the popularity of policy gradient algorithms and the fact that gradient ascent algorithms have been already proposed to numerically solve multi-objective optimization problems, especially in combination with multi-objective evolutionary algorithms, so far little attention has been paid to the use of gradient information to face multi-objective sequential decision problems. Two different Multi-Objective Reinforcement-Learning (MORL) approaches, called radial and Pareto following, that, starting from an initial policy, perform gradient-based policy-search procedures aimed at finding a set of non-dominated policies are here presented. Both algorithms are empirically evaluated and compared to state-of-the-art MORL algorithms on three MORL benchmark problems.
  • Keywords
    Markov processes; Pareto optimisation; approximation theory; decision making; evolutionary computation; gradient methods; learning (artificial intelligence); MOMDP; MORL approach; MORL benchmark problems; Pareto following; Pareto frontier approximation; gradient ascent algorithm; gradient-based policy-search procedure; multiobjective Markov decision process; multiobjective evolutionary algorithm; multiobjective optimization problems; multiobjective reinforcement-learning approach; multiobjective sequential decision making; nondominated policies; policy gradient approach; radial following; Approximation algorithms; Approximation methods; Covariance matrices; Optimization; Search problems; Vectors; Water resources;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), 2014 International Joint Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-6627-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2014.6889738
  • Filename
    6889738