DocumentCode
2188189
Title
Controlled Markov chains with risk-sensitive average cost criterion: the non-irreducible case
Author
Brau-Rojas, Agustin ; Fernández-Gaucherand, Emmanuel
Author_Institution
Departamento de Matematicas, Sonora Univ., Mexico
Volume
3
fYear
2001
fDate
2001
Firstpage
2108
Abstract
We study discrete controlled Markov chains with finite state and action spaces. The performance of control policies is measured by a risk-sensitive average cost, the exponential average cost (EAC), which models risk-sensitivity by means of an exponential (dis)utility function. The main result is the characterization of the EAC corresponding to an arbitrary stationary deterministic policy in terms of the spectral radii of suitable irreducible matrices. This result generalizes a well known theorem of Howard and Matheson (1972) that deals with the particular case in which the transition probability matrix induced by the policy is primitive. It is shown that, when a stationary deterministic policy determines only one class of recurrent states, the corresponding EAC converges to the risk-null average cost as the risk-sensitivity coefficient goes to zero. However, it is also shown that for large risk-sensitivity, fundamental differences arise between both models. A proof of the existence of solutions to the associated optimality equation, under a simultaneous Doeblin condition and for small enough risk-sensitivity coefficient, is given. Our proof relies on the Perron-Frobenius theory of non-negative matrices. An example that shows the impact of risk-sensitivity on the Hernandez-Hernandez condition for the existence of solutions to an optimality inequality is constructed
Keywords
Markov processes; costing; matrix algebra; operations research; optimisation; probability; risk management; sensitivity analysis; Doeblin condition; Hernandez Hernandez condition; Markov chains; Perron-Frobenius theory; exponential average cost; optimisation; risk-sensitive average cost; risk-sensitivity coefficient; transition probability matrix; utility function; Computer aided software engineering; Computer science; Cost function; Equations; H infinity control; Linear matrix inequalities; Optimal control;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 2001. Proceedings of the 40th IEEE Conference on
Conference_Location
Orlando, FL
Print_ISBN
0-7803-7061-9
Type
conf
DOI
10.1109/.2001.980563
Filename
980563
Link To Document