• DocumentCode
    2188216
  • Title

    Markov decision processes with risk-sensitive criteria: dynamic programming operators and discounted stochastic games

  • Author

    Cavazos-Cadena, Rolando ; Fernandez-Gaucherand, E.

  • Author_Institution
    Departamento de Estadistica y Calculo, Univ. Autonoma Agraria Antonio Narro, Saltillo, Mexico
  • Volume
    3
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    2110
  • Abstract
    We study discrete-time Markov decision processes with denumerable state space and bounded costs per stage. It is assumed that the decision maker exhibits a constant sensitivity to risk, and that the performance of a control policy is measured by a (long-run) risk-sensitive average cost criterion. Besides standard continuity-compactness conditions, the basic structural constraint on the decision model is that the transition law satisfies a simultaneous Doeblin condition. Within this framework, the main objective is to study the existence of bounded solutions to the risk-sensitive average cost optimality equation. Our main result guarantees a bounded solution to the optimality equation only if the risk sensitivity coefficient λ is small enough and, via a detailed example, it can be shown that such a conclusion cannot be extended to arbitrary values of λ. Our results are in opposition to previous claims in the literature, but agree with recent results obtained via a direct probabilistic analysis. A key analysis tool developed in the paper is the definition of an appropriate operator with contractive properties, analogous to the dynamic programming operator in Bellman´s equation, and a family of (value) functions with a discounted stochastic games interpretation
  • Keywords
    Markov processes; decision theory; dynamic programming; probability; state-space methods; stochastic games; stochastic systems; Bellman equation; Doeblin condition; Markov decision processes; bounded costs; decision model; discounted stochastic games; dynamic programming; optimality equation; probability; risk-sensitive average cost criterion; state space; stochastic dynamical systems; Computer science; Cost function; Difference equations; Dynamic programming; Performance analysis; Risk analysis; Space stations; State-space methods; Stochastic processes; Stochastic systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2001. Proceedings of the 40th IEEE Conference on
  • Conference_Location
    Orlando, FL
  • Print_ISBN
    0-7803-7061-9
  • Type

    conf

  • DOI
    10.1109/.2001.980564
  • Filename
    980564