Title of article :
Discounted Markov decision processes with utility constraints
Author/Authors :
Yoshinobu Kadota، نويسنده , , Masami Kurano، نويسنده , , Masami Yasuda، نويسنده ,
Issue Information :
دوهفته نامه با شماره پیاپی سال 2006
Abstract :
We consider utility-constrained Markov decision processes. The expected utility of the total discounted reward is maximized subject to multiple expected utility constraints. By introducing a corresponding Lagrange function, a saddle-point theorem of the utility constrained optimization is derived. The existence of a constrained optimal policy is characterized by optimal action sets specified with a parametric utility.
Keywords :
Constrained optimal policy , Saddle-point , Markov decision processes , Lagrange technique , Utility constraints , Discount criterion
Journal title :
Computers and Mathematics with Applications
Journal title :
Computers and Mathematics with Applications