DocumentCode :
1240091
Title :
Percentile performance criteria for limiting average Markov decision processes
Author :
Filar, Jerzy A. ; Krass, Dmitry ; Ross, K.W. ; Ross, Keith W.
Author_Institution :
Dept. of Math. & Stat., Maryland Univ., Baltimore, MD, USA
Volume :
40
Issue :
1
fYear :
1995
fDate :
1/1/1995 12:00:00 AM
Firstpage :
2
Lastpage :
10
Abstract :
Addresses the following basic feasibility problem for infinite-horizon Markov decision processes (MDPs): can a policy be found that achieves a specified value (target) of the long-run limiting average reward at a specified probability level (percentile)? Related optimization problems of maximizing the target for a specified percentile and vice versa are also considered. The authors present a complete (and discrete) classification of both the maximal achievable target levels and of their corresponding percentiles. The authors also provide an algorithm for computing a deterministic policy corresponding to any feasible target-percentile pair. Next the authors consider similar problems for an MDP with multiple rewards and/or constraints. This case presents some difficulties and leads to several open problems. An LP-based formulation provides constructive solutions for most cases
Keywords :
Markov processes; decision theory; linear programming; probability; LP-based formulation; deterministic policy; feasibility problem; infinite-horizon Markov decision processes; limiting average Markov decision processes; long-run limiting average reward; maximal achievable target levels; multiple constraints; multiple rewards; optimization problems; percentile performance criteria; probability level; Mathematics; Operations research; Optimal control; Probability distribution; Random variables; Statistics; Testing;
fLanguage :
English
Journal_Title :
Automatic Control, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9286
Type :
jour
DOI :
10.1109/9.362904
Filename :
362904
Link To Document :
بازگشت