DocumentCode
554112
Title
Cooperative multi-agent reinforcement learning based on online heuristic extraction
Author
Jun Wu ; Xin Xu ; Zhen-ping Sun ; Yan Huang
Author_Institution
Coll. of Mechatron. & Autom., Nat. Univ. of Defense Technol., Changsha, China
Volume
3
fYear
2011
fDate
26-28 July 2011
Firstpage
1299
Lastpage
1303
Abstract
Reinforcement learning has been an important technique for adaptive decision-making of multi-agent systems in uncertain environments. However, the curse of dimensionality in multi-agent reinforcement learning usually causes the slow learning convergence or even failure. In this paper, a novel Online Heuristics Extraction method, which can integrate the prior heuristic policy with a learned heuristic policy, is presented. The new method can be incorporated into a tabular or approximate cooperative multi-agent reinforcement learning algorithm so as to speed up the learning process. Simulation results on a cooperative learning task show that, with the new method, a much better learning convergence can be achieved.
Keywords
iterative methods; learning (artificial intelligence); multi-agent systems; adaptive decision making; cooperative multiagent reinforcement learning algorithm; heuristic policy; online heuristic extraction method; Approximation algorithms; Convergence; Heuristic algorithms; Joints; Learning; Learning systems; Robots; cooperative; heuristic policy; multi-agent; policy iteration; reinforcement learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Computation (ICNC), 2011 Seventh International Conference on
Conference_Location
Shanghai
ISSN
2157-9555
Print_ISBN
978-1-4244-9950-2
Type
conf
DOI
10.1109/ICNC.2011.6022301
Filename
6022301
Link To Document