Title of article :
On the adaptive control of a class of partially observed Markov decision processes
Author/Authors :
Hsu، نويسنده , , Shun-Pin and Arapostathis، نويسنده , , Ari، نويسنده ,
Issue Information :
دوهفته نامه با شماره پیاپی سال 2011
Pages :
9
From page :
1
To page :
9
Abstract :
This paper is concerned with the adaptive control problem, over the infinite horizon, for partially observable Markov decision processes whose transition functions are parameterized by an unknown vector. We treat finite models and impose relatively mild assumptions on the transition function. Provided that a sequence of parameter estimates converging in probability to the true parameter value is available, we show that the certainty equivalence adaptive policy is optimal in the long-run average sense.
Keywords :
Average-cost optimality , Markov decision processes , Adaptive control
Journal title :
Journal of Mathematical Analysis and Applications
Serial Year :
2011
Journal title :
Journal of Mathematical Analysis and Applications
Record number :
1561850
Link To Document :
بازگشت