DocumentCode
2498249
Title
Bayesian active learning with basis functions
Author
Ryzhov, Ilya O. ; Powell, Warren B.
Author_Institution
Oper. Res. & Financial Eng., Princeton Univ., Princeton, NJ, USA
fYear
2011
fDate
11-15 April 2011
Firstpage
143
Lastpage
150
Abstract
A common technique for dealing with the curse of dimensionality in approximate dynamic programming is to use a parametric value function approximation, where the value of being in a state is assumed to be a linear combination of basis functions. Even with this simplification, we face the exploration/exploitation dilemma: an inaccurate approximation may lead to poor decisions, making it necessary to sometimes explore actions that appear to be suboptimal. We propose a Bayesian strategy for active learning with basis functions, based on the knowledge gradient concept from the optimal learning literature. The new method performs well in numerical experiments conducted on an energy storage problem.
Keywords
Bayes methods; dynamic programming; function approximation; learning (artificial intelligence); Bayesian active learning; basis functions; dynamic programming; energy storage problem; knowledge gradient concept; parametric value function approximation; Bayesian methods; Covariance matrix; Dynamic programming; Function approximation; Mathematical model; Tin;
fLanguage
English
Publisher
ieee
Conference_Titel
Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
Conference_Location
Paris
Print_ISBN
978-1-4244-9887-1
Type
conf
DOI
10.1109/ADPRL.2011.5967365
Filename
5967365
Link To Document