Title :
A Marginal Productivity Index Policy for the Finite-Horizon Multiarmed Bandit Problem
Author_Institution :
Department of Statistics, Universidad Carlos III de Madrid, Av. Universidad 30, 28911 Leganés (Madrid), Spain. Email: jnimora@alum.mit.edu
Abstract :
This paper addresses the discounted discrete-state finite-horizon multiarmed bandit problem. The goal is to design a well-grounded and tractable scheduling policy of index type. The approach is based on reformulating the model as a restless bandit problem, and then deploying the marginal productivity index (MPI) theory developed by the author in recent work, which extends the approach of Klimov (1974), Gittins (1979) and Whittle (1988). It is shown that the model satisfies the author´s sufficient conditions for existence of the MPI. An efficient recursive procedure is proposed for computing the finite-horizon MPI. This further furnishes a new method for approximating the infinite-horizon Gittins index.
Keywords :
Current measurement; Dynamic scheduling; Electronic mail; Power generation economics; Productivity; Resource management; Statistics; Stochastic processes; Sufficient conditions; World Wide Web;
Conference_Titel :
Decision and Control, 2005 and 2005 European Control Conference. CDC-ECC '05. 44th IEEE Conference on
Conference_Location :
Seville, Spain
Print_ISBN :
0-7803-9567-0
DOI :
10.1109/CDC.2005.1582407