DocumentCode
1300835
Title
Mean-Variance Criteria for Finite Continuous-Time Markov Decision Processes
Author
Guo, Xianping ; Song, Xinyuan
Author_Institution
Sch. of Math. & Comput. Sci., Zhongshan Univ., Guangzhou, China
Volume
54
Issue
9
fYear
2009
Firstpage
2151
Lastpage
2157
Abstract
This technical note deals with the mean variance problem (known as the average variance (AV) minimization problem) for finite continuous time Markov decision processes. We first introduce a so called G-condition which is weaker than the well known ergodicity and unichain conditions and sufficient for the finiteness of the AV of a policy. Also, we present an example of a policy having infinite AV when the G-condition is not satisfied. Under the G-condition we prove that the AV criterion can be transformed into an equivalent mean (or expected) average criterion by using a martingale technique and an observation from the canonical form of a transition rate matrix, and thus the existence and calculation of an AV minimal policy over a class of mean optimal policies are obtained by a policy iteration algorithm in an finite number of iterations. As byproduct, we obtain some interesting new results about the mean average optimality.
Keywords
Markov processes; continuous time systems; iterative methods; G-condition; average variance minimization problem; canonical form; equivalent mean average criterion; ergodicity condition; finite continuous time Markov decision process; martingale technique; mean variance criteria; policy iteration algorithm; transition rate matrix; unichain condition; Councils; Mathematics; Minimization methods; Portfolios; Statistics; Terminology; AMS (2000): 90C40, 93E20; G-condition; finite continuous-time Markov decision process (CTMDP); mean-variance; policy iteration algorithm; variance minimization policy;
fLanguage
English
Journal_Title
Automatic Control, IEEE Transactions on
Publisher
ieee
ISSN
0018-9286
Type
jour
DOI
10.1109/TAC.2009.2023833
Filename
5208190
Link To Document