Title of article
Occupation measures in average cost Markov decision processes
Author/Authors
Hosaka، Masanori نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2000
Pages
-96
From page
97
To page
0
Abstract
We consider the average cost Markov decision processes (MDPʹs) with general state and action spaces. Extending the idea in Borkarʹs excellent paper [3, 4], we define an extended occupation measure associated with the class of policies for MDPʹs and an annexed index (called a power), by which the validity for optimization is measured. Also, by construction of an extended occupation measure, the policy with robustness for the cost function is given. The proofs are done without continuity and compactness and universally and/or analytically measurable policies are unnecessary to describe the results, which are new in this paper.
Journal title
Journal of Information and Optimization Sciences
Serial Year
2000
Journal title
Journal of Information and Optimization Sciences
Record number
38065
Link To Document