مرکز منطقه ای اطلاع رساني علوم و فناوري - Distributed reinforcement learning for multiple objective optimization problems

DocumentCode :

2436783

Title :

Distributed reinforcement learning for multiple objective optimization problems

Author :

Mariano, Carlos E. ; Morales, Eduardo F.

Author_Institution :

Inst. Mexicano de Tecnologia del Agua, Morelos, Mexico

Volume :

fYear :

2000

fDate :

2000

Firstpage :

188

Abstract :

This paper describes the application and performance evaluation of a new algorithm for multiple objective optimization problems (MOOP) based on reinforcement learning. The new algorithm, called MDQL, considers a family of agents for each objective function involved in a MOOP. Each agent proposes a solution for its corresponding objective function. Agents leave traces while they construct solutions considering traces made by other agents. The solutions proposed by the agents are evaluated using a non-domination criterion and solutions in the final Pareto set for each iteration are rewarded. A mechanism for the application of MDQL in continuous spaces which considers a fixed set of possible actions for the states (the number of actions depends on the dimensionality of the MOOP), is also proposed. Each action represents a path direction and its magnitude is changed dynamically depending on the evaluation of the state that the agent reached. Constraint handling, based on reinforcement comparison, considers reference values for constraints, penalizing agents violating any of them proportionally to the violation committed. MDQL performance was measured with “error ratio” and “spacing” metrics on four test bed problems suggested in the literature, showing competitive results with state-of-the-art algorithms

Keywords :

constraint handling; learning (artificial intelligence); multi-agent systems; optimisation; software performance evaluation; MDQL; Pareto set; constraint handling; distributed reinforcement learning; error ratio; multiple objective optimization problems; nondomination criterion; performance evaluation; software agents; spacing metrics; Autonomous agents; Evolutionary computation; Learning; Optimization methods; Pareto optimization; State estimation; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Evolutionary Computation, 2000. Proceedings of the 2000 Congress on

Conference_Location :

La Jolla, CA

Print_ISBN :

0-7803-6375-2

Type :

conf

DOI :

10.1109/CEC.2000.870294

Filename :

870294

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2436783