• DocumentCode
    3080962
  • Title

    Average and blackwell optimal policies in denumerable Markov decision chains

  • Author

    Hordijk, A.

  • Author_Institution
    University of Leiden
  • fYear
    1986
  • fDate
    10-12 Dec. 1986
  • Firstpage
    2121
  • Lastpage
    2123
  • Abstract
    This talk is based on the paper Dekker and Hordijk [1986]. In this paper we consider a (discrete-time) Markov decision chain with a denumerable state space and compact action sets and we assume that for all states the rewards and transition probabilities depend continuously on the actions. The first objective of this paper is to develop an analysis for average optimality without assuming a special Markov chain structure. In doing so, we present a set of conditions guaranteeing average optimality, which are automatically fulfilled in the finite state and action model. The second objective is to study simultaneously average and discount optimality as VEINOTT [1969] did for the finite state and action model. We investigate the concepts of n-discount and Blackwell optimality in the denumerable state space, using a Laurent series expansion for the discounted rewards. Under the same condition as for average optimality, we establish solutions to the n-discount optimality equations for every n.
  • Keywords
    Convergence; Equations; Optimal control; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 1986 25th IEEE Conference on
  • Conference_Location
    Athens, Greece
  • Type

    conf

  • DOI
    10.1109/CDC.1986.267436
  • Filename
    4049178