• DocumentCode
    2251402
  • Title

    Information state for Markov decision processes with network delays

  • Author

    Adlakha, Sachin ; Lall, Sanjay ; Goldsmith, Andrea

  • Author_Institution
    Dept. of Electr. Eng., Stanford Univ., Stanford, CA, USA
  • fYear
    2008
  • fDate
    9-11 Dec. 2008
  • Firstpage
    3840
  • Lastpage
    3847
  • Abstract
    We consider a networked control system, where each subsystem evolves as a Markov decision process (MDP). Each subsystem is coupled to its neighbors via communication links over which the signals are delayed, but are otherwise transmitted noise-free. A controller receives delayed state information from each subsystem. Such a networked Markov decision process with delays can be represented as a partially observed Markov decision process (POMDP). We show that this POMDP has a sufficient information state that depends only on a finite history of measurements and control actions. Thus, the POMDP can be converted into an information state MDP, whose state does not grow with time. The optimal controller for networked Markov decision processes can thus be computed using dynamic programming over a finite state space. This result generalizes the previous results on Markov decision processes with delayed state information.
  • Keywords
    Markov processes; delays; distributed control; optimal control; information state; network delays; networked control system; optimal controller; partially observed Markov decision process; Centralized control; Communication system control; Computer networks; Control systems; Delay systems; Dynamic programming; History; Networked control systems; Optimal control; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2008. CDC 2008. 47th IEEE Conference on
  • Conference_Location
    Cancun
  • ISSN
    0191-2216
  • Print_ISBN
    978-1-4244-3123-6
  • Electronic_ISBN
    0191-2216
  • Type

    conf

  • DOI
    10.1109/CDC.2008.4739234
  • Filename
    4739234