• DocumentCode
    955264
  • Title

    Self-Organizing Neural Architectures and Cooperative Learning in a Multiagent Environment

  • Author

    Xiao, Dan ; Tan, Ah-Hwee

  • Author_Institution
    Nanyang Technol. Univ., Singapore
  • Volume
    37
  • Issue
    6
  • fYear
    2007
  • Firstpage
    1567
  • Lastpage
    1580
  • Abstract
    Temporal-difference-fusion architecture for learning, cognition, and navigation (TD-FALCON) is a generalization of adaptive resonance theory (a class of self-organizing neural networks) that incorporates TD methods for real-time reinforcement learning. In this paper, we investigate how a team of TD-FALCON networks may cooperate to learn and function in a dynamic multiagent environment based on minefield navigation and a predator/prey pursuit tasks. Experiments on the navigation task demonstrate that TD-FALCON agent teams are able to adapt and function well in a multiagent environment without an explicit mechanism of collaboration. In comparison, traditional Q-learning agents using gradient-descent-based feedforward neural networks, trained with the standard backpropagation and the resilient-propagation (RPROP) algorithms, produce a significantly poorer level of performance. For the predator/prey pursuit task, we experiment with various cooperative strategies and find that a combination of a high-level compressed state representation and a hybrid reward function produces the best results. Using the same cooperative strategy, the TD-FALCON team also outperforms the RPROP-based reinforcement learners in terms of both task completion rate and learning efficiency.
  • Keywords
    learning (artificial intelligence); mining; multi-agent systems; navigation; predator-prey systems; self-organising feature maps; TD-FALCON agent teams; adaptive resonance theory; cooperative learning; minefield navigation; multiagent environment; predator/prey pursuit tasks; real-time reinforcement learning; self-organizing neural architectures; temporal-difference-fusion architecture learning cognition and navigation; Multiagent cooperative learning; reinforcement learning (RL); self-organizing neural architectures; Algorithms; Computer Simulation; Decision Support Techniques; Models, Theoretical; Neural Networks (Computer); Pattern Recognition, Automated;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2007.907040
  • Filename
    4362697