DocumentCode :
2188216
Title :
Markov decision processes with risk-sensitive criteria: dynamic programming operators and discounted stochastic games
Author :
Cavazos-Cadena, Rolando ; Fernandez-Gaucherand, E.
Author_Institution :
Departamento de Estadistica y Calculo, Univ. Autonoma Agraria Antonio Narro, Saltillo, Mexico
Volume :
3
fYear :
2001
fDate :
2001
Firstpage :
2110
Abstract :
We study discrete-time Markov decision processes with denumerable state space and bounded costs per stage. It is assumed that the decision maker exhibits a constant sensitivity to risk, and that the performance of a control policy is measured by a (long-run) risk-sensitive average cost criterion. Besides standard continuity-compactness conditions, the basic structural constraint on the decision model is that the transition law satisfies a simultaneous Doeblin condition. Within this framework, the main objective is to study the existence of bounded solutions to the risk-sensitive average cost optimality equation. Our main result guarantees a bounded solution to the optimality equation only if the risk sensitivity coefficient λ is small enough and, via a detailed example, it can be shown that such a conclusion cannot be extended to arbitrary values of λ. Our results are in opposition to previous claims in the literature, but agree with recent results obtained via a direct probabilistic analysis. A key analysis tool developed in the paper is the definition of an appropriate operator with contractive properties, analogous to the dynamic programming operator in Bellman´s equation, and a family of (value) functions with a discounted stochastic games interpretation
Keywords :
Markov processes; decision theory; dynamic programming; probability; state-space methods; stochastic games; stochastic systems; Bellman equation; Doeblin condition; Markov decision processes; bounded costs; decision model; discounted stochastic games; dynamic programming; optimality equation; probability; risk-sensitive average cost criterion; state space; stochastic dynamical systems; Computer science; Cost function; Difference equations; Dynamic programming; Performance analysis; Risk analysis; Space stations; State-space methods; Stochastic processes; Stochastic systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control, 2001. Proceedings of the 40th IEEE Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-7803-7061-9
Type :
conf
DOI :
10.1109/.2001.980564
Filename :
980564
Link To Document :
بازگشت