• DocumentCode
    1300835
  • Title

    Mean-Variance Criteria for Finite Continuous-Time Markov Decision Processes

  • Author

    Guo, Xianping ; Song, Xinyuan

  • Author_Institution
    Sch. of Math. & Comput. Sci., Zhongshan Univ., Guangzhou, China
  • Volume
    54
  • Issue
    9
  • fYear
    2009
  • Firstpage
    2151
  • Lastpage
    2157
  • Abstract
    This technical note deals with the mean variance problem (known as the average variance (AV) minimization problem) for finite continuous time Markov decision processes. We first introduce a so called G-condition which is weaker than the well known ergodicity and unichain conditions and sufficient for the finiteness of the AV of a policy. Also, we present an example of a policy having infinite AV when the G-condition is not satisfied. Under the G-condition we prove that the AV criterion can be transformed into an equivalent mean (or expected) average criterion by using a martingale technique and an observation from the canonical form of a transition rate matrix, and thus the existence and calculation of an AV minimal policy over a class of mean optimal policies are obtained by a policy iteration algorithm in an finite number of iterations. As byproduct, we obtain some interesting new results about the mean average optimality.
  • Keywords
    Markov processes; continuous time systems; iterative methods; G-condition; average variance minimization problem; canonical form; equivalent mean average criterion; ergodicity condition; finite continuous time Markov decision process; martingale technique; mean variance criteria; policy iteration algorithm; transition rate matrix; unichain condition; Councils; Mathematics; Minimization methods; Portfolios; Statistics; Terminology; AMS (2000): 90C40, 93E20; G-condition; finite continuous-time Markov decision process (CTMDP); mean-variance; policy iteration algorithm; variance minimization policy;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/TAC.2009.2023833
  • Filename
    5208190