Mean-Variance Criteria for Finite Continuous-Time Markov Decision Processes

Author

Guo, Xianping ; Song, Xinyuan

Author_Institution

Sch. of Math. & Comput. Sci., Zhongshan Univ., Guangzhou, China

Volume

54

Issue

9

fYear

2009

Firstpage

2151

Lastpage

2157

Abstract

This technical note deals with the mean variance problem (known as the average variance (AV) minimization problem) for finite continuous time Markov decision processes. We first introduce a so called G-condition which is weaker than the well known ergodicity and unichain conditions and sufficient for the finiteness of the AV of a policy. Also, we present an example of a policy having infinite AV when the G-condition is not satisfied. Under the G-condition we prove that the AV criterion can be transformed into an equivalent mean (or expected) average criterion by using a martingale technique and an observation from the canonical form of a transition rate matrix, and thus the existence and calculation of an AV minimal policy over a class of mean optimal policies are obtained by a policy iteration algorithm in an finite number of iterations. As byproduct, we obtain some interesting new results about the mean average optimality.

Keywords

Markov processes; continuous time systems; iterative methods; G-condition; average variance minimization problem; canonical form; equivalent mean average criterion; ergodicity condition; finite continuous time Markov decision process; martingale technique; mean variance criteria; policy iteration algorithm; transition rate matrix; unichain condition; Councils; Mathematics; Minimization methods; Portfolios; Statistics; Terminology; AMS (2000): 90C40, 93E20; G-condition; finite continuous-time Markov decision process (CTMDP); mean-variance; policy iteration algorithm; variance minimization policy;

fLanguage

English

Journal_Title

Automatic Control, IEEE Transactions on

Publisher

ieee

ISSN

0018-9286

Type

jour

DOI

10.1109/TAC.2009.2023833

Filename

5208190