• DocumentCode
    1743874
  • Title

    Markov decision processes with constrained stopping times

  • Author

    Woriguchi, M. ; Kurano, M. ; Yasuda, M.

  • Author_Institution
    Div. of Math. Sci. & Phys., Chiba Univ., Japan
  • Volume
    1
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    706
  • Abstract
    The optimization problem for a stopped Markov decision process is considered to be taken over stopping times τ constrained so that Eτ⩽α for some fixed α>0. We introduce the concept of a randomized stationary stopping time which is a mixed extension of the entry time of a stopping region and prove the existence of an optimal constrained pair of stationary policy and stopping time by utilizing a Lagrange multiplier approach. Also, applying the idea of the one-step look ahead (OLA) policy the optimal constrained pair is sought concretely. As an example, constrained Markov deteriorating system is explained
  • Keywords
    Markov processes; decision theory; optimisation; Lagrange multiplier approach; Markov decision processes; OLA policy; constrained Markov deteriorating system; constrained stopping times; entry time mixed extension; one-step look ahead policy; optimal constrained pair; randomized stationary stopping time; Constraint optimization; Convergence; Dynamic programming; Equations; Lagrangian functions; Probability distribution; State-space methods; Stochastic processes; Time factors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2000. Proceedings of the 39th IEEE Conference on
  • Conference_Location
    Sydney, NSW
  • ISSN
    0191-2216
  • Print_ISBN
    0-7803-6638-7
  • Type

    conf

  • DOI
    10.1109/CDC.2000.912850
  • Filename
    912850