• DocumentCode
    2436783
  • Title

    Distributed reinforcement learning for multiple objective optimization problems

  • Author

    Mariano, Carlos E. ; Morales, Eduardo F.

  • Author_Institution
    Inst. Mexicano de Tecnologia del Agua, Morelos, Mexico
  • Volume
    1
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    188
  • Abstract
    This paper describes the application and performance evaluation of a new algorithm for multiple objective optimization problems (MOOP) based on reinforcement learning. The new algorithm, called MDQL, considers a family of agents for each objective function involved in a MOOP. Each agent proposes a solution for its corresponding objective function. Agents leave traces while they construct solutions considering traces made by other agents. The solutions proposed by the agents are evaluated using a non-domination criterion and solutions in the final Pareto set for each iteration are rewarded. A mechanism for the application of MDQL in continuous spaces which considers a fixed set of possible actions for the states (the number of actions depends on the dimensionality of the MOOP), is also proposed. Each action represents a path direction and its magnitude is changed dynamically depending on the evaluation of the state that the agent reached. Constraint handling, based on reinforcement comparison, considers reference values for constraints, penalizing agents violating any of them proportionally to the violation committed. MDQL performance was measured with “error ratio” and “spacing” metrics on four test bed problems suggested in the literature, showing competitive results with state-of-the-art algorithms
  • Keywords
    constraint handling; learning (artificial intelligence); multi-agent systems; optimisation; software performance evaluation; MDQL; Pareto set; constraint handling; distributed reinforcement learning; error ratio; multiple objective optimization problems; nondomination criterion; performance evaluation; software agents; spacing metrics; Autonomous agents; Evolutionary computation; Learning; Optimization methods; Pareto optimization; State estimation; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2000. Proceedings of the 2000 Congress on
  • Conference_Location
    La Jolla, CA
  • Print_ISBN
    0-7803-6375-2
  • Type

    conf

  • DOI
    10.1109/CEC.2000.870294
  • Filename
    870294