Title of article

Discounted Markov decision processes with utility constraints

Author/Authors

Yoshinobu Kadota، نويسنده , , Masami Kurano، نويسنده , , Masami Yasuda، نويسنده ,

Issue Information

دوهفته نامه با شماره پیاپی سال 2006

Pages

From page

279

To page

284

Abstract

We consider utility-constrained Markov decision processes. The expected utility of the total discounted reward is maximized subject to multiple expected utility constraints. By introducing a corresponding Lagrange function, a saddle-point theorem of the utility constrained optimization is derived. The existence of a constrained optimal policy is characterized by optimal action sets specified with a parametric utility.

Keywords

Constrained optimal policy , Saddle-point , Markov decision processes , Lagrange technique , Utility constraints , Discount criterion

Journal title

Computers and Mathematics with Applications

Serial Year

2006

Journal title

Computers and Mathematics with Applications

Record number

919748

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=919748