DocumentCode :
2322900
Title :
Non-Bayesian Learning of Channel Sensing Order for Dynamic Spectrum Access Networks
Author :
Li, Bowen ; Yang, Panlong ; Wang, Jinlong ; Wu, Qihui ; Xia, Nan
Author_Institution :
Inst. of Commun. Eng., Nanjing, China
fYear :
2011
fDate :
10-12 Oct. 2011
Firstpage :
529
Lastpage :
534
Abstract :
In this paper, we consider sequential channel sensing and accessing with unknown channel availability probabilities. In such sequential channel sensing framework, it is critical important to arrange the channel sensing order properly at the beginning of each time slot, so as to balance the tradeoff between finding an idle channel as soon as possible in current slot (exploitation by sensing currently looks better channel preferentially) and making the channel statistics more precise to benefit the following sensing order arrangement (exploration by sensing the channels currently looks suboptimal). To handle this tradeoff balancing problem, we propose five online learning algorithms: Pure Exploitation (PE), Exploitation with Optimism Initial Estimation (EOIE), UCB-based Order (UCBO), ε-greedy Order and SoftMax Order. The latter three algorithms are extended from three classic algorithms on handling multi-armed bandit problem: UCB1 [1], ε-greedy [2] and SoftMax [2]. Among these algorithms, UCBO is the only one that we have proved to be a zero-regret strategy, which means the system is guaranteed to achieve optimal sensing order by applying UCBO, if enough slots are played. We then evaluate the algorithms by simulations. The results are a little surprising but meaningful. We find that the PE algorithm, which is treated as a simple strategy with poor performance for traditional multi-armed bandit problems, performs very well in the order learning problem when the number of channels is small. And our proposed EOIE algorithm performs nearly optimal in all cases. In contrast, the UCB-based Order converges too slowly for actual application, although it provides theoretical zero-regret guarantee as we have proved.
Keywords :
cognitive radio; learning (artificial intelligence); probability; radio access networks; telecommunication computing; ε-greedy order algorithm; UCB-based order algorithm; channel availability probability; channel statistics; dynamic spectrum access network; exploitation with optimism initial estimation algorithm; multiarmed bandit problem; nonBayesian learning; pure exploitation algorithm; sequential channel access; sequential channel sensing; softmax order algorithm; Algorithm design and analysis; Availability; Channel estimation; Estimation; Indexes; Numerical models; Sensors; cognitive radio; online learning; sensing order;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2011 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-1827-4
Type :
conf
DOI :
10.1109/CyberC.2011.91
Filename :
6079439
Link To Document :
بازگشت