DocumentCode :
962195
Title :
An Asymptotically Efficient Simulation-Based Algorithm for Finite Horizon Stochastic Dynamic Programming
Author :
Chang, Hyeong Soo ; Fu, Michael C. ; Hu, Jiaqiao ; Marcus, Steven I.
Author_Institution :
Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul
Volume :
52
Issue :
1
fYear :
2007
Firstpage :
89
Lastpage :
94
Abstract :
We present a simulation-based algorithm called "Simulated Annealing Multiplicative Weights" (SAMW) for solving large finite-horizon stochastic dynamic programming problems. At each iteration of the algorithm, a probability distribution over candidate policies is updated by a simple multiplicative weight rule, and with proper annealing of a control parameter, the generated sequence of distributions converges to a distribution concentrated only on the best policies. The algorithm is "asymptotically efficient," in the sense that for the goal of estimating the value of an optimal policy, a provably convergent finite-time upper bound for the sample mean is obtained
Keywords :
dynamic programming; probability; simulated annealing; stochastic programming; Markov decision process; finite horizon stochastic dynamic programming; probability distribution; simulated annealing multiplicative weight; simulation-based algorithm; Computer science; Dynamic programming; Mathematics; Probability distribution; Random number generation; Simulated annealing; Statistics; Stochastic processes; Uncertainty; Upper bound; Learning algorithms; Markov decision processes; simulated annealing; simulation; stochastic dynamic programming;
fLanguage :
English
Journal_Title :
Automatic Control, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9286
Type :
jour
DOI :
10.1109/TAC.2006.887917
Filename :
4060977
Link To Document :
بازگشت