• DocumentCode
    3312160
  • Title

    Asymptotic properties of Markov decision processes

  • Author

    Brockett, Roger

  • Author_Institution
    Sch. of Eng. & Appl. Sci., Harvard Univ., Cambridge, MA, USA
  • fYear
    2009
  • fDate
    15-18 Dec. 2009
  • Firstpage
    3587
  • Lastpage
    3591
  • Abstract
    In our earlier paper we derived the optimal control policy for a time varying, finite horizon, continuous time Markov processes subject to a quadratic penalty on the amount of the rate adjustment. Some aspects of the time invariant, infinite horizon problem were resolved but questions remained. In this paper we consider a more general class of performance measures and constraints on the controls. After deriving the appropriate Hamilton-Jacobi equation, we discuss asymptotic properties. The central question here relates to the possibility of that non constant (e.g., periodic) steady state policies may have better average performance than the best constant policy.
  • Keywords
    Jacobian matrices; Markov processes; optimal control; Hamilton-Jacobi equation; continuous time Markov decision processes; infinite horizon problem; optimal control policy; quadratic penalty; rate adjustment; Bismuth; Cost function; Counting circuits; Differential equations; Feedback control; Infinite horizon; Markov processes; Optimal control; State-space methods; Steady-state;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2009 held jointly with the 2009 28th Chinese Control Conference. CDC/CCC 2009. Proceedings of the 48th IEEE Conference on
  • Conference_Location
    Shanghai
  • ISSN
    0191-2216
  • Print_ISBN
    978-1-4244-3871-6
  • Electronic_ISBN
    0191-2216
  • Type

    conf

  • DOI
    10.1109/CDC.2009.5400564
  • Filename
    5400564