Title :
Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access
Author :
Liu, Keqin ; Zhao, Qing
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California at Davis, Davis, CA, USA
Abstract :
In this paper, we consider a class of restless multiarmed bandit processes (RMABs) that arises in dynamic multichannel access, user/server scheduling, and optimal activation in multiagent systems. For this class of RMABs, we establish the indexability and obtain Whittle index in closed form for both discounted and average reward criteria. These results lead to a direct implementation of Whittle index policy with remarkably low complexity. When arms are stochastically identical, we show that Whittle index policy is optimal under certain conditions. Furthermore, it has a semiuniversal structure that obviates the need to know the Markov transition probabilities. The optimality and the semiuniversal structure result from the equivalence between Whittle index policy and the myopic policy established in this work. For nonidentical arms, we develop efficient algorithms for computing a performance upper bound given by Lagrangian relaxation. The tightness of the upper bound and the near-optimal performance of Whittle index policy are illustrated with simulation examples.
Keywords :
Markov processes; multi-access systems; scheduling; Lagrangian relaxation; Markov transition probability; average reward criteria; dynamic multichannel access; multiagent systems; restless multiarmed bandit indexability problems; semiuniversal structure; user-server scheduling; whittle index optimality policy; Approximation methods; Channel models; Complexity theory; Indexes; Markov processes; Sensors; Upper bound; Dynamic channel selection; Whittle index; indexability; myopic policy; opportunistic access; restless multiarmed bandit (RMAB);
Journal_Title :
Information Theory, IEEE Transactions on
DOI :
10.1109/TIT.2010.2068950