DocumentCode
295134
Title
Weighted Markov decision processes with perturbation
Author
Liu, Ke ; Filar, Jerzy A.
Author_Institution
Sch. of Math., Univ. of South Australia, Adelaide, SA, Australia
Volume
3
fYear
1995
fDate
13-15 Dec 1995
Firstpage
2269
Abstract
Considers the weighted reward MDPs with perturbation. The authors give the proof of existence of a δ-optimal simple ultimately deterministic policy under the assumption of “scalar value”. The authors also prove that there exists a δ-i-optimal simple ultimately deterministic policy in the perturbed weighted MDP, for all ε ∈[0,ε´] even without the assumption of “scalar value”
Keywords
Markov processes; decision theory; δ-i-optimal simple ultimately deterministic policy; δ-optimal simple ultimately deterministic policy; weighted Markov decision processes; Australia; History; Mathematics; Radio frequency; Random variables; State-space methods; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 1995., Proceedings of the 34th IEEE Conference on
Conference_Location
New Orleans, LA
ISSN
0191-2216
Print_ISBN
0-7803-2685-7
Type
conf
DOI
10.1109/CDC.1995.480541
Filename
480541
Link To Document