• DocumentCode
    658708
  • Title

    Combining Dynamic Reward Shaping and Action Shaping for Coordinating Multi-agent Learning

  • Author

    Xiangbin Zhu ; Chongjie Zhang ; Lesser, Victor

  • Volume
    2
  • fYear
    2013
  • fDate
    17-20 Nov. 2013
  • Firstpage
    321
  • Lastpage
    328
  • Abstract
    Coordinating multi-agent reinforcement learning provides a promising approach to scaling learning in large cooperative multi-agent systems. It allows agents to learn local decision policies based on their local observations and rewards, and, meanwhile, coordinates agents´ learning processes to ensure the global learning performance. One key question is that how coordination mechanisms impact learning algorithms so that agents´ learning processes are guided and coordinated. This paper presents a new shaping approach that effectively integrates coordination mechanisms into local learning processes. This shaping approach uses two-level agent organization structures and combines reward shaping and action shaping. The higher-level agents dynamically and periodically produce the shaping heuristic knowledge based on the learning status of the lower-level agents. The lower-level agents then uses this knowledge to coordinate their local learning processes with other agents. Experimental results show our approach effectively speeds up the convergence of multi-agent learning in large systems.
  • Keywords
    decision making; learning (artificial intelligence); multi-agent systems; agent learning processes; cooperative multiagent systems; dynamic action shaping; dynamic reward shaping; global learning performance; heuristic knowledge shaping; learning scaling approach; local decision policies; local observations; local rewards; lower-level agents; multiagent reinforcement learning coordination; two-level agent organization structures; Educational institutions; Equations; Learning (artificial intelligence); Mathematical model; Multi-agent systems; Organizations; Supervisory control; Action Shaping; Multi-Agent Learning; Organization Control; Reward Shaping; Supervision;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2013 IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Atlanta, GA
  • Print_ISBN
    978-1-4799-2902-3
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2013.127
  • Filename
    6690807