• DocumentCode
    1653443
  • Title

    Denumerable controlled Markov chains with average reward criterion: sample path optimality

  • Author

    Cavazos-Cadena, Rolando ; Fernandez-Gaucheraud, E.

  • Author_Institution
    Dept. de Estadistica y Calculo, Univ. Autonoma Agraria Antonio Narro, Saltillo, Mexico
  • Volume
    1
  • fYear
    1994
  • Firstpage
    162
  • Abstract
    There are numerous applications, in many different fields, of denumerable controlled Markov chain (CMC) models with an infinite planning horizon; see Bertsekas (1987), Ephremides and Verdu (1989), Ross (1983), Stidham and Weber (1993), and Tijms (1986). The authors consider the stochastic control problem of maximizing average rewards in the long-run, for denumerable CMCs. Departing from the most common position which uses expected values of rewards, the authors focus on a sample path analysis of the stream of states and actions. Under a Lyapunov function condition, the authors show that stationary policies obtained from the average reward optimality equation are not only expected average reward optimal, but indeed sample path average reward optimal
  • Keywords
    Markov processes; decision theory; stochastic systems; Lyapunov function condition; average reward criterion; denumerable controlled Markov chains; infinite planning horizon; sample path optimality; stationary policies; stochastic control problem; Equations; Extraterrestrial measurements; History; Lyapunov method; Operations research; Optimal control; State-space methods; Stochastic processes; Stochastic systems; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 1994., Proceedings of the 33rd IEEE Conference on
  • Conference_Location
    Lake Buena Vista, FL
  • Print_ISBN
    0-7803-1968-0
  • Type

    conf

  • DOI
    10.1109/CDC.1994.411028
  • Filename
    411028