DocumentCode
1653443
Title
Denumerable controlled Markov chains with average reward criterion: sample path optimality
Author
Cavazos-Cadena, Rolando ; Fernandez-Gaucheraud, E.
Author_Institution
Dept. de Estadistica y Calculo, Univ. Autonoma Agraria Antonio Narro, Saltillo, Mexico
Volume
1
fYear
1994
Firstpage
162
Abstract
There are numerous applications, in many different fields, of denumerable controlled Markov chain (CMC) models with an infinite planning horizon; see Bertsekas (1987), Ephremides and Verdu (1989), Ross (1983), Stidham and Weber (1993), and Tijms (1986). The authors consider the stochastic control problem of maximizing average rewards in the long-run, for denumerable CMCs. Departing from the most common position which uses expected values of rewards, the authors focus on a sample path analysis of the stream of states and actions. Under a Lyapunov function condition, the authors show that stationary policies obtained from the average reward optimality equation are not only expected average reward optimal, but indeed sample path average reward optimal
Keywords
Markov processes; decision theory; stochastic systems; Lyapunov function condition; average reward criterion; denumerable controlled Markov chains; infinite planning horizon; sample path optimality; stationary policies; stochastic control problem; Equations; Extraterrestrial measurements; History; Lyapunov method; Operations research; Optimal control; State-space methods; Stochastic processes; Stochastic systems; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 1994., Proceedings of the 33rd IEEE Conference on
Conference_Location
Lake Buena Vista, FL
Print_ISBN
0-7803-1968-0
Type
conf
DOI
10.1109/CDC.1994.411028
Filename
411028
Link To Document