• DocumentCode
    637558
  • Title

    A sensitivity-based construction approach to sample-path variance minimization of Markov decision processes

  • Author

    Yonghao Huang ; Xi Chen

  • Author_Institution
    Dept. of Autom., Tsinghua Univ., Beijing, China
  • fYear
    2012
  • fDate
    15-16 Nov. 2012
  • Firstpage
    215
  • Lastpage
    220
  • Abstract
    We study the limiting average variance along the sample path as the secondary criterion for Markov decision processes, with the long-run average performance as the primary criterion. By applying the sensitivity-based approach, we intuitively construct the difference formula for the sample-path variance under different policies. Thereby, a sufficient condition for the sample-path variance optimality can be easily derived. This work extends the sensitivity-based construction approach to the Markov decision processes with the nonstandard performance criterion. Compared with the pure mathematical verification, the sensitivity-based construction approach shows more intuition and provides insights on the sample-path structure of Markov decision processes.
  • Keywords
    Markov processes; decision making; minimisation; Markov decision processes; limiting average variance; long-run average performance; nonstandard performance criterion; pure mathematical verification; sample-path variance minimization; sample-path variance optimality; sensitivity-based construction approach; Australia; Limiting; Markov processes; Minimization; Optimization; Poisson equations; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control Conference (AUCC), 2012 2nd Australian
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-1-922107-63-3
  • Type

    conf

  • Filename
    6613199