• DocumentCode
    2858599
  • Title

    A reinforcement learning approach to cooperative problem solving

  • Author

    Yoshida, Tetsuya ; Hori, Koichi ; Nakasuka, Shinichi

  • Author_Institution
    Graduate Sch. of Eng. Sci., Osaka Univ., Japan
  • fYear
    1998
  • fDate
    3-7 Jul 1998
  • Firstpage
    479
  • Lastpage
    480
  • Abstract
    We propose an extension of reinforcement learning methods to cooperative problem solving in multi agent systems. Exploiting multiple agents for complex problems is promising, however, learning is necessary since complete domain knowledge is rarely available. The temporal difference algorithm is applied in each agent to learn a heuristic evaluation of states. Besides the reward for solutions produced by agents, we define the reward for coherence as a whole and exploit them to facilitate cooperation among agents for global problem solving. We evaluate the method by experiments on the satellite design problem. The result shows that our method enables agents to learn to cooperate as well as to learn individual heuristics within one framework. Especially, agents themselves learn to take the appropriate balance between exploration and exploitation in problem solving, which is known to greatly affect the performance. It also suggests the possibility of controlling the global behavior of multi agent systems via rewards in reinforcement learning.
  • Keywords
    learning (artificial intelligence); cooperative problem solving; domain knowledge; global behavior; global problem solving; heuristic evaluation; multi agent systems; reinforcement learning approach; reinforcement learning methods; reward; satellite design problem; temporal difference algorithm; Autonomous agents; Control systems; Distributed computing; Learning; Multiagent systems; Problem-solving; Quality management; Resource management; Satellites; Time factors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multi Agent Systems, 1998. Proceedings. International Conference on
  • Print_ISBN
    0-8186-8500-X
  • Type

    conf

  • DOI
    10.1109/ICMAS.1998.699295
  • Filename
    699295