• DocumentCode
    388792
  • Title

    Achieving corporative behavior in heterogeneous agents using hierarchic reinforcement learning-an approach to piano mover´s problem

  • Author

    Ishiwaka, Yuko ; Yoshida, Tomohiro ; Yokoi, Hiroshi ; Kakazu, Yukinori

  • Author_Institution
    Hakodate Nat. Coll. of Technol., Hokkaido, Japan
  • Volume
    4
  • fYear
    2002
  • fDate
    6-9 Oct. 2002
  • Abstract
    Our approach is to achieve the cooperative behavior of autonomous decentralized agents constructed with Q-Learning, which is a type of reinforcement learning. The piano mover´s problem is employed. We propose the multi agent architecture that has an external agent and internal agents. Internal agents are heterogeneous and they can communicate with each other. The movement of the external agent depends on the composition of the actions of internal agents. According to learning its own shape by internal agents, it is expected that the agents avoid obstacles. We simulate our method on a two-dimensional continuous world. The results show the effect of our method.
  • Keywords
    learning (artificial intelligence); multi-agent systems; Q-Learning; autonomous decentralized agents; cooperative behavior; heterogeneous agents; hierarchic reinforcement learning; multi agent architecture; obstacle avoidance; piano mover problem; two-dimensional continuous world; Collision avoidance; Educational institutions; Geometry; Learning; Mobile robots; Motion planning; Multiagent systems; Path planning; Remotely operated vehicles; Shape;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2002 IEEE International Conference on
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-7437-1
  • Type

    conf

  • DOI
    10.1109/ICSMC.2002.1173252
  • Filename
    1173252