DocumentCode
582608
Title
Best response learning based on Gaussian regression for multi-agent systems in continuous spaces
Author
Haijun, Wei ; Xin, Chen ; Min, Wu ; Weihua, Cao
Author_Institution
Sch. of Inf. Sci. & Eng., Central South Univ., Changsha, China
fYear
2012
fDate
25-27 July 2012
Firstpage
6196
Lastpage
6201
Abstract
In the implementations of multi-agent systems, generalization is always viewed as one of the key issues before multi-agent reinforcement learning algorithms are applicable to continuous environments. The paper proposes a best response learning based on Gaussian regression for multi-agent systems in continuous spaces. With a new Q value with reduced dimension defined, the algorithm entitles agent to learning strategy adapting to others´ behaviors. To realize generalization, probabilistic model of state transition in the algorithm is constructed by using Gaussian regression, so that dynamic programming can be applied directly to generate the best response strategy. And both Q-function model and V-function model are built real time in order to generalize state and action spaces. Thus the learning agent is able to tracking partners´ strategies. In the simulation of Double-cart-pole, which is a typical coordinated control problem, even if dynamics is unknown in priori, the algorithm enables agent to learn coordinated strategy, and realize generalization of state space as well.
Keywords
Gaussian processes; learning (artificial intelligence); multi-agent systems; regression analysis; Gaussian regression; Q value; Q-function model; V-function model; best response learning; continuous spaces; coordinated control problem; double-cart-pole; multiagent reinforcement learning algorithms; multiagent systems; reduced dimension; Decision support systems; Continues Spaces; Gaussian Regression; Multi-agent systems; Reinforcement learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Control Conference (CCC), 2012 31st Chinese
Conference_Location
Hefei
ISSN
1934-1768
Print_ISBN
978-1-4673-2581-3
Type
conf
Filename
6391027
Link To Document