Multi-Agent Cooperation by Q-Learning in Continuous Action Domain

Author

Hwang, Kao-Shing ; Lin, Yu-Hong ; Lo, Chia-Yue

Author_Institution

Electr. Eng., Nat. Chung Cheng Univ., Ming-Hsiung

fYear

2008

fDate

1-3 Nov. 2008

Firstpage

111

Lastpage

114

Abstract

In this paper we propose Q-learning with continuous action space and extend this algorithm to a multi-agent system. Conventional Q-learning needs a pre-defined and discrete state space. But it is not practical because the states of the environment in the real world and actions are both continuous. The algorithm will use a concept that is similar to the SRV (Stochastic Real-Valued Unit) to train the actions in each state. The convergence of the SRV may fall into local solution even if it has never reached the optimal solution. In order to overcome this drawback, the Q-learning with SRRV (Stochastic Recording Real-Valued unit) is proposed, and it shows that the SRRV will converge more quickly.

Keywords

learning (artificial intelligence); multi-agent systems; state-space methods; stochastic processes; Q-learning; continuous action domain; continuous action space; discrete state space; multiagent cooperation; multiagent system; predefined state space; stochastic real-valued unit; stochastic recording real-valued unit; Intelligent networks; Intelligent systems; Least squares approximation; Machine learning; Machine learning algorithms; Multiagent systems; Robots; Space exploration; State-space methods; Stochastic processes; Q-learning; Stochastic Real-Valued;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Networks and Intelligent Systems, 2008. ICINIS '08. First International Conference on

Conference_Location

Wuhan

Print_ISBN

978-0-7695-3391-9

Electronic_ISBN

978-0-7695-3391-9

Type

conf

DOI

10.1109/ICINIS.2008.160

Filename

4683180