• DocumentCode
    2717164
  • Title

    An Optimal ADP Algorithm for a High-Dimensional Stochastic Control Problem

  • Author

    Nascimento, Juliana ; Powell, Warren

  • Author_Institution
    Dept. of Operations Res. & Financial Eng., Princeton Univ., NJ
  • fYear
    2007
  • fDate
    1-5 April 2007
  • Firstpage
    52
  • Lastpage
    59
  • Abstract
    We propose a provably optimal approximate dynamic programming algorithm for a class of multistage stochastic problems, taking into account that the probability distribution of the underlying stochastic process is not known and the state space is too large to be explored entirely. The algorithm and its proof of convergence rely on the fact that the optimal value functions of the problems within the problem class are concave and piecewise linear. The algorithm is a combination of Monte Carlo simulation, pure exploitation, stochastic approximation and a projection operation. Several applications, in areas like energy, control, inventory and finance, fall under the framework
  • Keywords
    Monte Carlo methods; dynamic programming; probability; stochastic processes; Monte Carlo simulation; optimal approximate dynamic programming; optimal value functions; probability distribution; projection operation; pure exploitation; stochastic approximation; stochastic control; Approximation algorithms; Convergence; Dynamic programming; Heuristic algorithms; Optimal control; Piecewise linear approximation; Piecewise linear techniques; Probability distribution; State-space methods; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0706-0
  • Type

    conf

  • DOI
    10.1109/ADPRL.2007.368169
  • Filename
    4220814