DocumentCode
637558
Title
A sensitivity-based construction approach to sample-path variance minimization of Markov decision processes
Author
Yonghao Huang ; Xi Chen
Author_Institution
Dept. of Autom., Tsinghua Univ., Beijing, China
fYear
2012
fDate
15-16 Nov. 2012
Firstpage
215
Lastpage
220
Abstract
We study the limiting average variance along the sample path as the secondary criterion for Markov decision processes, with the long-run average performance as the primary criterion. By applying the sensitivity-based approach, we intuitively construct the difference formula for the sample-path variance under different policies. Thereby, a sufficient condition for the sample-path variance optimality can be easily derived. This work extends the sensitivity-based construction approach to the Markov decision processes with the nonstandard performance criterion. Compared with the pure mathematical verification, the sensitivity-based construction approach shows more intuition and provides insights on the sample-path structure of Markov decision processes.
Keywords
Markov processes; decision making; minimisation; Markov decision processes; limiting average variance; long-run average performance; nonstandard performance criterion; pure mathematical verification; sample-path variance minimization; sample-path variance optimality; sensitivity-based construction approach; Australia; Limiting; Markov processes; Minimization; Optimization; Poisson equations; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Control Conference (AUCC), 2012 2nd Australian
Conference_Location
Sydney, NSW
Print_ISBN
978-1-922107-63-3
Type
conf
Filename
6613199
Link To Document