DocumentCode :
2717686
Title :
Coordinated Reinforcement Learning for Decentralized Optimal Control
Author :
Yagan, Daniel ; Tham, Chen-Khong
Author_Institution :
Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore
fYear :
2007
fDate :
1-5 April 2007
Firstpage :
296
Lastpage :
302
Abstract :
We consider a multi-agent system where the overall performance is affected by the joint actions or policies of agents. However, each agent only observes a partial view of the global state condition. This model is known as a decentralized partially-observable Markov decision process (DEC-POMDP), which can be considered more applicable in real-world applications such as communication networks. It is known that the exact solution to a DEC-POMDP is NEXP-complete and memory requirements grow exponentially even for finite-horizon problems. In this paper, we propose to address these issues by using an online model-free technique and by exploiting the locality of interaction among agents in order to approximate the joint optimal policy. Simulation results show the effectiveness and convergence of the proposed algorithm in the context of resource allocation for multiagent wireless multi-hop networks.
Keywords :
decentralised control; learning (artificial intelligence); multi-agent systems; optimal control; coordinated reinforcement learning; decentralized optimal control; decentralized partially-observable Markov decision process; finite-horizon problem; global state condition; joint optimal policy; multiagent system; multiagent wireless multihop network; online model-free technique; resource allocation; Communication networks; Context modeling; Control systems; Dynamic programming; Learning; Multiagent systems; Optimal control; Resource management; Spread spectrum communication; Stochastic processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0706-0
Type :
conf
DOI :
10.1109/ADPRL.2007.368202
Filename :
4220847
Link To Document :
بازگشت