Title :
Multi-objective fitted Q-iteration: Pareto frontier approximation in one single run
Author :
Castelletti, Andrea ; Pianosi, Francesca ; Restelli, Marcello
Author_Institution :
Dipt. di Elettron. e Inf., Politec. di Milano, Milan, Italy
Abstract :
We present a novel batch-mode Reinforcement Learning approach for the design of optimal controllers in the presence of multiple objectives. The algorithm is an extension of Fitted Q-iteration (FQI) that enables to design the controller for all the linear combinations of preferences (weights) assigned to the objectives in a single run. The key idea of multi-objective FQI (MOFQI) is to enlarge the continuous approximation of the value function, which is performed by single-objective FQI over the state-control space, also to the weight space. The bacth-mode nature of the algorithm makes it possible the enrichment of the learning data with nearly no additional computational cost with respect to a single-objective formulation on the same system. The approach was tested on a simple test case study concerning the optimal operation of a two-objective water reservoir, where MOFQI algorithm proved to be computationally preferable over repeatedly running FQI for different weight values when more than five points on the Pareto frontier are considered.
Keywords :
control system synthesis; function approximation; learning (artificial intelligence); optimal control; reservoirs; MOFQI algorithm; Pareto frontier approximation; batch-mode reinforcement learning approach; multiobjective fitted Q-iteration; optimal controller design; state-control space; value function continuous approximation; water reservoir; weight space; Aerospace electronics; Algorithm design and analysis; Approximation algorithms; Approximation methods; Optimization; Reservoirs;
Conference_Titel :
Networking, Sensing and Control (ICNSC), 2011 IEEE International Conference on
Conference_Location :
Delft
Print_ISBN :
978-1-4244-9570-2
DOI :
10.1109/ICNSC.2011.5874921