Coordinated Reinforcement Learning for Decentralized Optimal Control

Author

Yagan, Daniel ; Tham, Chen-Khong

Author_Institution

Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore

fYear

2007

fDate

1-5 April 2007

Firstpage

296

Lastpage

302

Abstract

We consider a multi-agent system where the overall performance is affected by the joint actions or policies of agents. However, each agent only observes a partial view of the global state condition. This model is known as a decentralized partially-observable Markov decision process (DEC-POMDP), which can be considered more applicable in real-world applications such as communication networks. It is known that the exact solution to a DEC-POMDP is NEXP-complete and memory requirements grow exponentially even for finite-horizon problems. In this paper, we propose to address these issues by using an online model-free technique and by exploiting the locality of interaction among agents in order to approximate the joint optimal policy. Simulation results show the effectiveness and convergence of the proposed algorithm in the context of resource allocation for multiagent wireless multi-hop networks.

Keywords

decentralised control; learning (artificial intelligence); multi-agent systems; optimal control; coordinated reinforcement learning; decentralized optimal control; decentralized partially-observable Markov decision process; finite-horizon problem; global state condition; joint optimal policy; multiagent system; multiagent wireless multihop network; online model-free technique; resource allocation; Communication networks; Context modeling; Control systems; Dynamic programming; Learning; Multiagent systems; Optimal control; Resource management; Spread spectrum communication; Stochastic processes;

fLanguage

English

Publisher

ieee

Conference_Titel

Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on

Conference_Location

Honolulu, HI

Print_ISBN

1-4244-0706-0

Type

conf

DOI

10.1109/ADPRL.2007.368202

Filename

4220847