• DocumentCode
    554112
  • Title

    Cooperative multi-agent reinforcement learning based on online heuristic extraction

  • Author

    Jun Wu ; Xin Xu ; Zhen-ping Sun ; Yan Huang

  • Author_Institution
    Coll. of Mechatron. & Autom., Nat. Univ. of Defense Technol., Changsha, China
  • Volume
    3
  • fYear
    2011
  • fDate
    26-28 July 2011
  • Firstpage
    1299
  • Lastpage
    1303
  • Abstract
    Reinforcement learning has been an important technique for adaptive decision-making of multi-agent systems in uncertain environments. However, the curse of dimensionality in multi-agent reinforcement learning usually causes the slow learning convergence or even failure. In this paper, a novel Online Heuristics Extraction method, which can integrate the prior heuristic policy with a learned heuristic policy, is presented. The new method can be incorporated into a tabular or approximate cooperative multi-agent reinforcement learning algorithm so as to speed up the learning process. Simulation results on a cooperative learning task show that, with the new method, a much better learning convergence can be achieved.
  • Keywords
    iterative methods; learning (artificial intelligence); multi-agent systems; adaptive decision making; cooperative multiagent reinforcement learning algorithm; heuristic policy; online heuristic extraction method; Approximation algorithms; Convergence; Heuristic algorithms; Joints; Learning; Learning systems; Robots; cooperative; heuristic policy; multi-agent; policy iteration; reinforcement learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation (ICNC), 2011 Seventh International Conference on
  • Conference_Location
    Shanghai
  • ISSN
    2157-9555
  • Print_ISBN
    978-1-4244-9950-2
  • Type

    conf

  • DOI
    10.1109/ICNC.2011.6022301
  • Filename
    6022301