Title of article :
A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain cases
Author/Authors :
Xi-Ren Cao، نويسنده , , Xianping Guo، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2004
Keywords :
Perturbation analysis , Performance sensitivity , Policy iteration , Potentials , reinforcement learning
Journal title :
Automatica
Journal title :
Automatica