Title :
Multi-objective optimisation by reinforcement learning
Author :
Liao, H.L. ; Wu, Q.H.
Author_Institution :
Dept. of Electr. Eng. & Electron., Univ. of Liverpool, Liverpool, UK
Abstract :
This paper presents a multi-objective optimisation by reinforcement learning, called MORL, to solve complex multi-objective optimisation problems, in particular those in a high-dimensional space. In MORL, the search is undertaken on individual dimension in a high-dimensional space via a path selected by an estimated path value. Path values, estimated by weighting the state values on the selected path, represent the potentiality of finding a better solution if searching on the paths, and are used to memorize the quality of previously visited states. In MORL, visited states are assigned with different immediate rewards by comparing the objective vector of current state with those of the Pareto optimal solutions found previously. These Pareto optimal solutions are stored in an elite list, which keeps track of the non-dominated solutions found so far and is used to construct the Pareto front at the end of the optimisation process. MORL is compared with a promising multi-objective evolutionary algorithm based on decomposition (MOEA/D) on four widely-used benchmark functions. The simulation results have demonstrated that MORL is superior over MOEA/D with respect to the accuracy and the range of the Pareto fronts, especially in solving high-dimensional multi-objective optimisation problems.
Keywords :
Pareto optimisation; evolutionary computation; learning (artificial intelligence); Pareto front; Pareto optimal solution; multiobjective evolutionary algorithm; multiobjective optimisation; objective vector; path value estimation; reinforcement learning; Animals; Benchmark testing; Evolutionary computation; Learning; Optimization; Search problems; Temperature distribution;
Conference_Titel :
Evolutionary Computation (CEC), 2010 IEEE Congress on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-6909-3
DOI :
10.1109/CEC.2010.5585972