• DocumentCode
    2358933
  • Title

    Multi-agent Multi-objective Learning Using Heuristically Accelerated Reinforcement Learning

  • Author

    Ferreira, Leonardo A. ; Bianchi, Reinaldo A C ; Ribeiro, Carlos H C

  • Author_Institution
    Centro Univ. da FEI, Sao Bernardo do Campo, Brazil
  • fYear
    2012
  • fDate
    16-19 Oct. 2012
  • Firstpage
    14
  • Lastpage
    20
  • Abstract
    This paper introduces two new algorithms aimed at solving multi-agent multi-objective reinforcement learning problems in which the learning agent must not only interact with multiples agents but also consider various objectives (or criteria) in order to solve the problem. The main concept behind the proposed algorithms is a modular approach that is used to divide the multiple objectives in modules, and making each one of these modules learn a different objective with different Action-Value and reinforcement functions. Besides the decomposition of objectives, both algorithms use a heuristic function to accelerate the learning process. The first algorithm learns one objective at a time, iterating along the objectives, while the second proposed algorithm also divides the problem in sub-problems but learns every objective simultaneously. The Predator-Prey problem was chosen to compare the performance of both proposed solutions with well known algorithms. In this problem, the learning agent plays the role of the prey and must learn to find food in a fixed position of a grid world while being pursued by the predator. The considered objectives are finding food and avoiding the predator. As the results shows, decomposing a multi-objective problem in sub-problems and using heuristics makes the learning process faster and easier to implement. We notice that the first algorithm introduced in this paper learns faster, but it is more difficult to implement in a real world environment.
  • Keywords
    learning (artificial intelligence); multi-agent systems; predator-prey systems; action-value; heuristic function; modular approach; multi-agent multiobjective learning; multiple objectives; multiples agents; predator-prey problem; reinforcement learning; Accelerated aging; Acceleration; Convergence; Heuristic algorithms; Learning; Learning systems; Vectors; Artificial Intelligence; Machine Learning; Multiagent systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics Symposium and Latin American Robotics Symposium (SBR-LARS), 2012 Brazilian
  • Conference_Location
    Fortaleza
  • Print_ISBN
    978-1-4673-4650-4
  • Type

    conf

  • DOI
    10.1109/SBR-LARS.2012.10
  • Filename
    6363312