• DocumentCode
    2717686
  • Title

    Coordinated Reinforcement Learning for Decentralized Optimal Control

  • Author

    Yagan, Daniel ; Tham, Chen-Khong

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore
  • fYear
    2007
  • fDate
    1-5 April 2007
  • Firstpage
    296
  • Lastpage
    302
  • Abstract
    We consider a multi-agent system where the overall performance is affected by the joint actions or policies of agents. However, each agent only observes a partial view of the global state condition. This model is known as a decentralized partially-observable Markov decision process (DEC-POMDP), which can be considered more applicable in real-world applications such as communication networks. It is known that the exact solution to a DEC-POMDP is NEXP-complete and memory requirements grow exponentially even for finite-horizon problems. In this paper, we propose to address these issues by using an online model-free technique and by exploiting the locality of interaction among agents in order to approximate the joint optimal policy. Simulation results show the effectiveness and convergence of the proposed algorithm in the context of resource allocation for multiagent wireless multi-hop networks.
  • Keywords
    decentralised control; learning (artificial intelligence); multi-agent systems; optimal control; coordinated reinforcement learning; decentralized optimal control; decentralized partially-observable Markov decision process; finite-horizon problem; global state condition; joint optimal policy; multiagent system; multiagent wireless multihop network; online model-free technique; resource allocation; Communication networks; Context modeling; Control systems; Dynamic programming; Learning; Multiagent systems; Optimal control; Resource management; Spread spectrum communication; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0706-0
  • Type

    conf

  • DOI
    10.1109/ADPRL.2007.368202
  • Filename
    4220847