Title :
Approximate dynamic programming using fluid and diffusion approximations with applications to power management
Author :
Chen, Wei ; Huang, Dayu ; Kulkarni, Ankur A. ; Unnikrishnan, Jayakrishnan ; Zhu, Quanyan ; Mehta, Prashant ; Meyn, Sean ; Wierman, Adam
Author_Institution :
Coordinated Sci. Lab., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Abstract :
TD learning and its refinements are powerful tools for approximating the solution to dynamic programming problems. However, the techniques provide the approximate solution only within a prescribed finite-dimensional function class. Thus, the question that always arises is how should the function class be chosen? The goal of this paper is to propose an approach for TD learning based on choosing the function class using the solutions to associated fluid and diffusion approximations. In order to illustrate this new approach, the paper focuses on an application to dynamic speed scaling for power management.
Keywords :
approximation theory; dynamic programming; learning systems; multidimensional systems; TD learning; approximate dynamic programming; diffusion approximation; dynamic programming problems; dynamic speed scaling; finite-dimensional function class; fluid approximation; power management; Communication system control; Costs; Delay; Dynamic programming; Energy management; Equations; Fluid dynamics; Power system modeling; State-space methods; Stochastic processes;
Conference_Titel :
Decision and Control, 2009 held jointly with the 2009 28th Chinese Control Conference. CDC/CCC 2009. Proceedings of the 48th IEEE Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-3871-6
Electronic_ISBN :
0191-2216
DOI :
10.1109/CDC.2009.5399685