DocumentCode
3312160
Title
Asymptotic properties of Markov decision processes
Author
Brockett, Roger
Author_Institution
Sch. of Eng. & Appl. Sci., Harvard Univ., Cambridge, MA, USA
fYear
2009
fDate
15-18 Dec. 2009
Firstpage
3587
Lastpage
3591
Abstract
In our earlier paper we derived the optimal control policy for a time varying, finite horizon, continuous time Markov processes subject to a quadratic penalty on the amount of the rate adjustment. Some aspects of the time invariant, infinite horizon problem were resolved but questions remained. In this paper we consider a more general class of performance measures and constraints on the controls. After deriving the appropriate Hamilton-Jacobi equation, we discuss asymptotic properties. The central question here relates to the possibility of that non constant (e.g., periodic) steady state policies may have better average performance than the best constant policy.
Keywords
Jacobian matrices; Markov processes; optimal control; Hamilton-Jacobi equation; continuous time Markov decision processes; infinite horizon problem; optimal control policy; quadratic penalty; rate adjustment; Bismuth; Cost function; Counting circuits; Differential equations; Feedback control; Infinite horizon; Markov processes; Optimal control; State-space methods; Steady-state;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 2009 held jointly with the 2009 28th Chinese Control Conference. CDC/CCC 2009. Proceedings of the 48th IEEE Conference on
Conference_Location
Shanghai
ISSN
0191-2216
Print_ISBN
978-1-4244-3871-6
Electronic_ISBN
0191-2216
Type
conf
DOI
10.1109/CDC.2009.5400564
Filename
5400564
Link To Document