Title :
Approximately optimal adaptive learning in opportunistic spectrum access
Author :
Tekin, Cem ; Liu, Mingyan
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Univ. of Michigan, Ann Arbor, MI, USA
Abstract :
In this paper we develop an adaptive learning algorithm which is approximately optimal for an opportunistic spectrum access (OSA) problem with polynomial complexity. In this OSA problem each channel is modeled as a two state discrete time Markov chain with a bad state which yields no reward and a good state which yields reward. This is known as the Gilbert-Elliot channel model and represents variations in the channel condition due to fading, primary user activity, etc. There is a user who can transmit on one channel at a time, and whose goal is to maximize its throughput. Without knowing the transition probabilities and only observing the state of the channel currently selected, the user faces a partially observed Markov decision problem (POMDP) with unknown transition structure. In general, learning the optimal policy in this setting is intractable. We propose a computationally efficient learning algorithm which is approximately optimal for the infinite horizon average reward criterion.
Keywords :
Markov processes; channel allocation; fading channels; learning (artificial intelligence); telecommunication computing; Gilbert-Elliot channel model; adaptive learning algorithm; approximately optimal adaptive learning; discrete time Markov chain; fading channel; opportunistic spectrum access; partially observed Markov decision problem; polynomial complexity; primary user activity; Approximation algorithms; Complexity theory; Indexes; Markov processes; Optimized production technology; Polynomials; Probability; Approximate optimality; online learning; opportunistic spectrum access; restless bandits;
Conference_Titel :
INFOCOM, 2012 Proceedings IEEE
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4673-0773-4
DOI :
10.1109/INFCOM.2012.6195522