Cooperative multi-agent reinforcement learning based on online heuristic extraction

Author

Jun Wu ; Xin Xu ; Zhen-ping Sun ; Yan Huang

Author_Institution

Coll. of Mechatron. & Autom., Nat. Univ. of Defense Technol., Changsha, China

Volume

3

fYear

2011

fDate

26-28 July 2011

Firstpage

1299

Lastpage

1303

Abstract

Reinforcement learning has been an important technique for adaptive decision-making of multi-agent systems in uncertain environments. However, the curse of dimensionality in multi-agent reinforcement learning usually causes the slow learning convergence or even failure. In this paper, a novel Online Heuristics Extraction method, which can integrate the prior heuristic policy with a learned heuristic policy, is presented. The new method can be incorporated into a tabular or approximate cooperative multi-agent reinforcement learning algorithm so as to speed up the learning process. Simulation results on a cooperative learning task show that, with the new method, a much better learning convergence can be achieved.

Keywords

iterative methods; learning (artificial intelligence); multi-agent systems; adaptive decision making; cooperative multiagent reinforcement learning algorithm; heuristic policy; online heuristic extraction method; Approximation algorithms; Convergence; Heuristic algorithms; Joints; Learning; Learning systems; Robots; cooperative; heuristic policy; multi-agent; policy iteration; reinforcement learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Natural Computation (ICNC), 2011 Seventh International Conference on

Conference_Location

Shanghai

ISSN

2157-9555

Print_ISBN

978-1-4244-9950-2

Type

conf

DOI

10.1109/ICNC.2011.6022301

Filename

6022301