• DocumentCode
    1180220
  • Title

    Markov decision processes with delays and asynchronous cost collection

  • Author

    Katsikopoulos, Konstantinos V. ; Engelbrecht, Sascha E.

  • Author_Institution
    Dept. of Mech. & Ind. Eng., Massachusetts Univ., Amherst, MA, USA
  • Volume
    48
  • Issue
    4
  • fYear
    2003
  • fDate
    4/1/2003 12:00:00 AM
  • Firstpage
    568
  • Lastpage
    574
  • Abstract
    Markov decision processes (MDPs) may involve three types of delays. First, state information, rather than being available instantaneously, may arrive with a delay (observation delay). Second, an action may take effect at a later decision stage rather than immediately (action delay). Third, the cost induced by an action may be collected after a number of stages (cost delay). We de rive two results, one for constant and one for random delays, for reducing an MDP with delays to an MDP without delays, which differs only in the size of the state space. The results are based on the intuition that costs may be collected asynchronously, i.e., at a stage other than the one in which they are induced, as long as they are discounted properly.
  • Keywords
    Markov processes; decision making; decision theory; delays; dynamic programming; neural nets; state estimation; Markov decision processes; action delay; asynchronous cost collection; constant delays; delays; neuro-dynamic programming; observation delay; random delays; state information; Cognition; Computer science; Control systems; Cost function; Decision making; Delay effects; Humans; Industrial engineering; Optimal control; State-space methods;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/TAC.2003.809799
  • Filename
    1193736