• DocumentCode
    2863169
  • Title

    Reinforcement learning in swarms that learn

  • Author

    Peters, James F. ; Henry, Christopher ; Ramanna, Sheela

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Manitoba Univ., Winnipeg, Man., Canada
  • fYear
    2005
  • fDate
    19-22 Sept. 2005
  • Firstpage
    400
  • Lastpage
    406
  • Abstract
    This paper introduces an approach to reinforcement learning by cooperating agents using a variation of the actor critic method. This is made possible by considering behavior patterns of swarms in the context of approximation spaces. Rough set theory introduced by Zdzislaw Pawlak in 1982 provides a ground for deriving pattern-based rewards within approximation spaces. The framework provided by an approximation space makes it possible to derive pattern-based reference rewards used to estimate action preferences. Approximation spaces are used to derive action-based reference rewards at the swarm intelligence level. Two different forms of the actor critic reinforcement learning method are considered as a part of a study of learning in real-time by a swarm. The contribution of this article is the presentation of a new actor critic method defined in the context of approximation spaces. An ecosystem designed to facilitate study of reinforcement learning by swarms is briefly described. In addition, the results of ecosystem experiments for two forums of the actor critic method are given.
  • Keywords
    learning (artificial intelligence); multi-agent systems; rough set theory; actor critic method; approximation space; cooperating agent; ecosystem; reference reward; reinforcement learning; rough set theory; swarm behavior pattern; Computer science; Ecosystems; Extraterrestrial measurements; Learning; Multiagent systems; Particle swarm optimization; Rough sets; Set theory; Space technology; State estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Agent Technology, IEEE/WIC/ACM International Conference on
  • Print_ISBN
    0-7695-2416-8
  • Type

    conf

  • DOI
    10.1109/IAT.2005.145
  • Filename
    1565572