مرکز منطقه ای اطلاع رساني علوم و فناوري - Continuous-time Markov decision processes with th-bias optimality criteria

Title of article :

Continuous-time Markov decision processes with th-bias optimality criteria

Author/Authors :

Zhang، نويسنده , , Junyu and Cao، نويسنده , , Xi-Ren، نويسنده ,

Issue Information :

روزنامه با شماره پیاپی سال 2009

Pages :

From page :

1628

To page :

1638

Abstract :

In this paper, we study the n th-bias optimality problem for finite continuous-time Markov decision processes (MDPs) with a multichain structure. We first provide n th-bias difference formulas for two policies and present some interesting characterizations of an n th-bias optimal policy by using these difference formulas. Then, we prove the existence of an n th-bias optimal policy by using n th-bias optimal policy iteration algorithms, and show that such an n th-bias optimal policy can be obtained in a finite number of policy iterations.

Keywords :

Multichain model , Continuous-time systems , Policy iteration algorithms , performance analysis , Sensitivity analysis , n th-bias optimality criteria , Markov decision processes

Journal title :

Automatica

Serial Year :

2009

Journal title :

Automatica

Record number :

1447709

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=1447709