• DocumentCode
    2582953
  • Title

    Risk-constrained Markov decision processes

  • Author

    Borkar, Vivek ; Jain, Rahul

  • Author_Institution
    Sch. of Technol. & Comput. Sci., Tata Inst. of Fundamental Res. (TIFR) Mumbai, Mumbai, India
  • fYear
    2010
  • fDate
    15-17 Dec. 2010
  • Firstpage
    2664
  • Lastpage
    2669
  • Abstract
    We propose a new constrained Markov decision process framework with risk-type constraints. The risk metric we use is Conditional Value-at-Risk (CVaR), which is gaining popularity in finance. It is a conditional expectation but the conditioning is defined in terms of the level of the tail probability. We propose an iterative offline algorithm to find the risk-contrained optimal control policy. A stochastic approximation-inspired `learning´ variant is also sketched.
  • Keywords
    Markov processes; approximation theory; iterative methods; optimal control; stochastic programming; Markov decision process; conditional value-at-risk; iterative offline algorithm; optimal control; risk-type constraints; stochastic approximation-inspired learning variant; tail probability; Approximation algorithms; Approximation methods; Convergence; Markov processes; Optimization; Yttrium; Constrained Markov decision processes; Risk measures; Stochastic Approximations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control (CDC), 2010 49th IEEE Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    0743-1546
  • Print_ISBN
    978-1-4244-7745-6
  • Type

    conf

  • DOI
    10.1109/CDC.2010.5718076
  • Filename
    5718076