• DocumentCode
    1576236
  • Title

    Multi-objective fitted Q-iteration: Pareto frontier approximation in one single run

  • Author

    Castelletti, Andrea ; Pianosi, Francesca ; Restelli, Marcello

  • Author_Institution
    Dipt. di Elettron. e Inf., Politec. di Milano, Milan, Italy
  • fYear
    2011
  • Firstpage
    260
  • Lastpage
    265
  • Abstract
    We present a novel batch-mode Reinforcement Learning approach for the design of optimal controllers in the presence of multiple objectives. The algorithm is an extension of Fitted Q-iteration (FQI) that enables to design the controller for all the linear combinations of preferences (weights) assigned to the objectives in a single run. The key idea of multi-objective FQI (MOFQI) is to enlarge the continuous approximation of the value function, which is performed by single-objective FQI over the state-control space, also to the weight space. The bacth-mode nature of the algorithm makes it possible the enrichment of the learning data with nearly no additional computational cost with respect to a single-objective formulation on the same system. The approach was tested on a simple test case study concerning the optimal operation of a two-objective water reservoir, where MOFQI algorithm proved to be computationally preferable over repeatedly running FQI for different weight values when more than five points on the Pareto frontier are considered.
  • Keywords
    control system synthesis; function approximation; learning (artificial intelligence); optimal control; reservoirs; MOFQI algorithm; Pareto frontier approximation; batch-mode reinforcement learning approach; multiobjective fitted Q-iteration; optimal controller design; state-control space; value function continuous approximation; water reservoir; weight space; Aerospace electronics; Algorithm design and analysis; Approximation algorithms; Approximation methods; Optimization; Reservoirs;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networking, Sensing and Control (ICNSC), 2011 IEEE International Conference on
  • Conference_Location
    Delft
  • Print_ISBN
    978-1-4244-9570-2
  • Type

    conf

  • DOI
    10.1109/ICNSC.2011.5874921
  • Filename
    5874921