Title of article :
Continuous-time Markov decision processes with th-bias optimality criteria
Author/Authors :
Zhang، نويسنده , , Junyu and Cao، نويسنده , , Xi-Ren، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2009
Pages :
11
From page :
1628
To page :
1638
Abstract :
In this paper, we study the n th-bias optimality problem for finite continuous-time Markov decision processes (MDPs) with a multichain structure. We first provide n th-bias difference formulas for two policies and present some interesting characterizations of an n th-bias optimal policy by using these difference formulas. Then, we prove the existence of an n th-bias optimal policy by using n th-bias optimal policy iteration algorithms, and show that such an n th-bias optimal policy can be obtained in a finite number of policy iterations.
Keywords :
Multichain model , Continuous-time systems , Policy iteration algorithms , performance analysis , Sensitivity analysis , n th-bias optimality criteria , Markov decision processes
Journal title :
Automatica
Serial Year :
2009
Journal title :
Automatica
Record number :
1447709
Link To Document :
بازگشت