Title :
Denumerable controlled Markov chains with average reward criterion: sample path optimality
Author :
Cavazos-Cadena, Rolando ; Fernandez-Gaucheraud, E.
Author_Institution :
Dept. de Estadistica y Calculo, Univ. Autonoma Agraria Antonio Narro, Saltillo, Mexico
Abstract :
There are numerous applications, in many different fields, of denumerable controlled Markov chain (CMC) models with an infinite planning horizon; see Bertsekas (1987), Ephremides and Verdu (1989), Ross (1983), Stidham and Weber (1993), and Tijms (1986). The authors consider the stochastic control problem of maximizing average rewards in the long-run, for denumerable CMCs. Departing from the most common position which uses expected values of rewards, the authors focus on a sample path analysis of the stream of states and actions. Under a Lyapunov function condition, the authors show that stationary policies obtained from the average reward optimality equation are not only expected average reward optimal, but indeed sample path average reward optimal
Keywords :
Markov processes; decision theory; stochastic systems; Lyapunov function condition; average reward criterion; denumerable controlled Markov chains; infinite planning horizon; sample path optimality; stationary policies; stochastic control problem; Equations; Extraterrestrial measurements; History; Lyapunov method; Operations research; Optimal control; State-space methods; Stochastic processes; Stochastic systems; Topology;
Conference_Titel :
Decision and Control, 1994., Proceedings of the 33rd IEEE Conference on
Conference_Location :
Lake Buena Vista, FL
Print_ISBN :
0-7803-1968-0
DOI :
10.1109/CDC.1994.411028