DocumentCode
66656
Title
A New Optimal Stepsize for Approximate Dynamic Programming
Author
Ryzhov, Ilya O. ; Frazier, Peter I. ; Powell, Warren B.
Author_Institution
Robert H. Smith Sch. of Bus., Univ. of Maryland, College Park, MD, USA
Volume
60
Issue
3
fYear
2015
fDate
Mar-15
Firstpage
743
Lastpage
758
Abstract
Approximate dynamic programming (ADP) has proven itself in a wide range of applications spanning large-scale transportation problems, health care, revenue management, and energy systems. The design of effective ADP algorithms has many dimensions, but one crucial factor is the stepsize rule used to update a value function approximation. Many operations research applications are computationally intensive, and it is important to obtain good results quickly. Furthermore, the most popular stepsize formulas use tunable parameters and can produce very poor results if tuned improperly. We derive a new stepsize rule that optimizes the prediction error in order to improve the short-term performance of an ADP algorithm. With only one, relatively insensitive tunable parameter, the new rule adapts to the level of noise in the problem and produces faster convergence in numerical experiments.
Keywords
dynamic programming; function approximation; ADP algorithms; approximate dynamic programming; energy systems; health care; insensitive tunable parameter; large-scale transportation problems; operations research applications; optimal stepsize; revenue management; stepsize rule; tunable parameters; value function approximation; Approximation algorithms; Convergence; Dynamic programming; Function approximation; Operations research; Tin; Approximate dynamic programming (ADP); Kalman filter; simulation-based optimization; stochastic approximation;
fLanguage
English
Journal_Title
Automatic Control, IEEE Transactions on
Publisher
ieee
ISSN
0018-9286
Type
jour
DOI
10.1109/TAC.2014.2357134
Filename
6897935
Link To Document