• DocumentCode
    1428772
  • Title

    Multiagent reinforcement learning using function approximation

  • Author

    Abul, Osman ; Polat, Faruk ; Alhajj, Reda

  • Author_Institution
    Dept. of Comput. Eng., Middle East Tech. Univ., Ankara, Turkey
  • Volume
    30
  • Issue
    4
  • fYear
    2000
  • fDate
    11/1/2000 12:00:00 AM
  • Firstpage
    485
  • Lastpage
    497
  • Abstract
    Learning in a partially observable and nonstationary environment is still one of the challenging problems in the area of multiagent (MA) learning. Reinforcement learning is a generic method that suits the needs of MA learning in many aspects. This paper presents two new multiagent based domain independent coordination mechanisms for reinforcement learning; multiple agents do not require explicit communication among themselves to learn coordinated behavior. The first coordination mechanism is the perceptual coordination mechanism, where other agents are included in state descriptions and coordination information is learned from state transitions. The second is the observing coordination mechanism, which also includes other agents in state descriptions and additionally the rewards of nearby agents are observed from the environment. The observed rewards and agent´s own reward are used to construct an optimal policy. This way, the latter mechanism tends to increase region-wide joint rewards. The selected experimented domain is adversarial food-collecting world (AFCW), which can be configured both as single and multiagent environments. Function approximation and generalization techniques are used because of the huge state space. Experimental results show the effectiveness of these mechanisms
  • Keywords
    function approximation; generalisation (artificial intelligence); learning (artificial intelligence); multi-agent systems; adversarial food-collecting world; agent communication; domain independent coordination; experimental results; function approximation; generalization; multiagent reinforcement learning; nonstationary environment; observing coordination mechanism; optimal policy; partially observable environment; perceptual coordination mechanism; region-wide joint rewards; state transitions; Artificial intelligence; Control systems; Function approximation; Intelligent robots; Learning; Medical control systems; Multiagent systems; Phase change materials; State-space methods; Table lookup;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1094-6977
  • Type

    jour

  • DOI
    10.1109/5326.897075
  • Filename
    897075