DocumentCode
3510427
Title
Multi-Agent Cooperation by Q-Learning in Continuous Action Domain
Author
Hwang, Kao-Shing ; Lin, Yu-Hong ; Lo, Chia-Yue
Author_Institution
Electr. Eng., Nat. Chung Cheng Univ., Ming-Hsiung
fYear
2008
fDate
1-3 Nov. 2008
Firstpage
111
Lastpage
114
Abstract
In this paper we propose Q-learning with continuous action space and extend this algorithm to a multi-agent system. Conventional Q-learning needs a pre-defined and discrete state space. But it is not practical because the states of the environment in the real world and actions are both continuous. The algorithm will use a concept that is similar to the SRV (Stochastic Real-Valued Unit) to train the actions in each state. The convergence of the SRV may fall into local solution even if it has never reached the optimal solution. In order to overcome this drawback, the Q-learning with SRRV (Stochastic Recording Real-Valued unit) is proposed, and it shows that the SRRV will converge more quickly.
Keywords
learning (artificial intelligence); multi-agent systems; state-space methods; stochastic processes; Q-learning; continuous action domain; continuous action space; discrete state space; multiagent cooperation; multiagent system; predefined state space; stochastic real-valued unit; stochastic recording real-valued unit; Intelligent networks; Intelligent systems; Least squares approximation; Machine learning; Machine learning algorithms; Multiagent systems; Robots; Space exploration; State-space methods; Stochastic processes; Q-learning; Stochastic Real-Valued;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Networks and Intelligent Systems, 2008. ICINIS '08. First International Conference on
Conference_Location
Wuhan
Print_ISBN
978-0-7695-3391-9
Electronic_ISBN
978-0-7695-3391-9
Type
conf
DOI
10.1109/ICINIS.2008.160
Filename
4683180
Link To Document