• DocumentCode
    1580743
  • Title

    Distributed, heterogeneous, multi-agent social coordination via reinforcement learning

  • Author

    Shi, Dongqing ; Sauter, Michael Z. ; Kralik, Jerald D.

  • Author_Institution
    Dept. of Psychological & Brain Sci., Dartmouth Coll., Hanover, NH, USA
  • fYear
    2009
  • Firstpage
    653
  • Lastpage
    658
  • Abstract
    Multi-agent systems are becoming more popular in a variety of problem domains that benefit from increased parallelism, system robustness, and scalability, ranging from search and rescue to investment management. Multi-agent systems analysis studies how multiple agents coordinate with each other to maximize some team goal or individual best reward. Coordination achieved through learning provides a great advantage over modeling methods, especially when tasks become very complex and environments more dynamic. Because social primates such as chimpanzees are a highly successful multi-agent system that uses learning to adapt flexibly to changing social and environmental conditions, we are attempting to simulate their social cognition and behavior. The paper presents a foraging task to study how multiple agents can use reinforcement learning to coordinate as a group under social constraints, while also trying to maximize their own reward. Each distributed, heterogenous agent uses the WoLF-PHC algorithm, and with no communication, the agents learn to select the best foraging patch based on the behavior of others through the ¿win or learn fast¿ heuristic. The simulation results demonstrate that the agents can perform in a manner similar to the natural social behavior of chimpanzees, and show that we have a working model system for studying more complex chimpanzee social behavior in the future.
  • Keywords
    learning (artificial intelligence); multi-robot systems; robust control; WoLF-PHC algorithm; complex chimpanzee social behavior; investment management; multiagent social coordination; reinforcement learning; system robustness; win or learn fast heuristic; Biomimetics; Conference management; Convergence; Investments; Learning; Multiagent systems; Parallel robots; Robot kinematics; Robustness; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Biomimetics (ROBIO), 2009 IEEE International Conference on
  • Conference_Location
    Guilin
  • Print_ISBN
    978-1-4244-4774-9
  • Electronic_ISBN
    978-1-4244-4775-6
  • Type

    conf

  • DOI
    10.1109/ROBIO.2009.5420595
  • Filename
    5420595