Best response learning based on Gaussian regression for multi-agent systems in continuous spaces

Author

Haijun, Wei ; Xin, Chen ; Min, Wu ; Weihua, Cao

Author_Institution

Sch. of Inf. Sci. & Eng., Central South Univ., Changsha, China

fYear

2012

fDate

25-27 July 2012

Firstpage

6196

Lastpage

6201

Abstract

In the implementations of multi-agent systems, generalization is always viewed as one of the key issues before multi-agent reinforcement learning algorithms are applicable to continuous environments. The paper proposes a best response learning based on Gaussian regression for multi-agent systems in continuous spaces. With a new Q value with reduced dimension defined, the algorithm entitles agent to learning strategy adapting to others´ behaviors. To realize generalization, probabilistic model of state transition in the algorithm is constructed by using Gaussian regression, so that dynamic programming can be applied directly to generate the best response strategy. And both Q-function model and V-function model are built real time in order to generalize state and action spaces. Thus the learning agent is able to tracking partners´ strategies. In the simulation of Double-cart-pole, which is a typical coordinated control problem, even if dynamics is unknown in priori, the algorithm enables agent to learn coordinated strategy, and realize generalization of state space as well.

Keywords

Gaussian processes; learning (artificial intelligence); multi-agent systems; regression analysis; Gaussian regression; Q value; Q-function model; V-function model; best response learning; continuous spaces; coordinated control problem; double-cart-pole; multiagent reinforcement learning algorithms; multiagent systems; reduced dimension; Decision support systems; Continues Spaces; Gaussian Regression; Multi-agent systems; Reinforcement learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Control Conference (CCC), 2012 31st Chinese

Conference_Location

Hefei

ISSN

1934-1768

Print_ISBN

978-1-4673-2581-3

Type

conf

Filename

6391027