مرکز منطقه ای اطلاع رساني علوم و فناوري - An exact iterative search algorithm for constrained Markov decision processes

Title of article :

An exact iterative search algorithm for constrained Markov decision processes

Author/Authors :

Chang، نويسنده , , Hyeong Soo، نويسنده ,

Issue Information :

روزنامه با شماره پیاپی سال 2014

Pages :

From page :

1531

To page :

1534

Abstract :

This communique provides an exact iterative search algorithm for the NP-hard problem of obtaining an optimal feasible stationary Markovian pure policy that achieves the maximum value averaged over an initial state distribution in finite constrained Markov decision processes. It is based on a novel characterization of the entire feasible policy space and takes the spirit of policy iteration (PI) in that a sequence of monotonically improving feasible policies is generated and converges to an optimal policy in iterations of the size of the policy space at the worst case. Unlike PI, an unconstrained MDP needs to be solved at iterations involved with feasible policies and the current best policy improves all feasible policies included in the union of the policy spaces associated with the unconstrained MDPs.

Keywords :

Markov decision processes , Constrained Optimization , Dynamic programming , Policy iteration

Journal title :

Automatica

Serial Year :

2014

Journal title :

Automatica

Record number :

1449864

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=1449864