• Title of article

    Sample path optimality for a Markov optimization problem

  • Author/Authors

    Hunt، نويسنده , , F.Y.، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2005
  • Pages
    11
  • From page
    769
  • To page
    779
  • Abstract
    We study a unichain Markov decision process i.e. a controlled Markov process whose state process under a stationary policy is an ergodic Markov chain. Here the state and action spaces are assumed to be either finite or countable. When the state process is uniformly ergodic and the immediate cost is bounded then a policy that minimizes the long-term expected average cost also has an nth stage sample path cost that with probability one is asymptotically less than the nth stage sample path cost under any other non-optimal stationary policy with a larger expected average cost. This is a strengthening in the Markov model case of the a.s. asymptotically optimal property frequently discussed in the literature.
  • Keywords
    stochastic control , Azumaיs inequality , Markov decision process
  • Journal title
    Stochastic Processes and their Applications
  • Serial Year
    2005
  • Journal title
    Stochastic Processes and their Applications
  • Record number

    1577611