• DocumentCode
    5542
  • Title

    First Passage Optimality for Continuous-Time Markov Decision Processes With Varying Discount Factors and History-Dependent Policies

  • Author

    Xianping Guo ; Xinyuan Song ; Yi Zhang

  • Author_Institution
    Sch. of Math. & Comput. Sci., Sun Yat-Sen Univ., Guangzhou, China
  • Volume
    59
  • Issue
    1
  • fYear
    2014
  • fDate
    Jan. 2014
  • Firstpage
    163
  • Lastpage
    174
  • Abstract
    This paper is an attempt to study the first passage optimality criterion for continuous-time Markov decision processes with state-dependent discount factors and history-dependent policies. The state space is denumerable, the action space is a Borel space, and the transition and reward rates are unbounded. Under suitable conditions, we show the existence of a deterministic stationary optimal policy, establish the Bellman (optimality) equation, to which the value function is the unique solution, and give the value and policy iteration algorithms for solving (at least approximating) the value function and an optimal policy. Furthermore, we give examples about reliability and controlled birth processes with killing to illustrate the potential applications of the results obtained here, and also to show the difference between the main results in this paper and those in the previous literature.
  • Keywords
    Markov processes; decision making; decision theory; iterative methods; Bellman equation; Borel space; action space; continuous-time Markov decision processes; controlled birth processes; deterministic stationary optimal policy; first passage optimality criterion; history-dependent policies; policy iteration algorithms; reward rates; state-dependent discount factors; transition rates; value iteration algorithms; Educational institutions; Equations; Kernel; Markov processes; Optimization; Process control; Standards; Continuous-time Markov decision process; first passage criterion; varying discount factor;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/TAC.2013.2281475
  • Filename
    6595596