مرکز منطقه ای اطلاع رساني علوم و فناوري - Non-Bayesian Learning of Channel Sensing Order for Dynamic Spectrum Access Networks

DocumentCode :

2322900

Title :

Non-Bayesian Learning of Channel Sensing Order for Dynamic Spectrum Access Networks

Author :

Li, Bowen ; Yang, Panlong ; Wang, Jinlong ; Wu, Qihui ; Xia, Nan

Author_Institution :

Inst. of Commun. Eng., Nanjing, China

fYear :

2011

fDate :

10-12 Oct. 2011

Firstpage :

529

Lastpage :

534

Abstract :

In this paper, we consider sequential channel sensing and accessing with unknown channel availability probabilities. In such sequential channel sensing framework, it is critical important to arrange the channel sensing order properly at the beginning of each time slot, so as to balance the tradeoff between finding an idle channel as soon as possible in current slot (exploitation by sensing currently looks better channel preferentially) and making the channel statistics more precise to benefit the following sensing order arrangement (exploration by sensing the channels currently looks suboptimal). To handle this tradeoff balancing problem, we propose five online learning algorithms: Pure Exploitation (PE), Exploitation with Optimism Initial Estimation (EOIE), UCB-based Order (UCBO), ε-greedy Order and SoftMax Order. The latter three algorithms are extended from three classic algorithms on handling multi-armed bandit problem: UCB1 [1], ε-greedy [2] and SoftMax [2]. Among these algorithms, UCBO is the only one that we have proved to be a zero-regret strategy, which means the system is guaranteed to achieve optimal sensing order by applying UCBO, if enough slots are played. We then evaluate the algorithms by simulations. The results are a little surprising but meaningful. We find that the PE algorithm, which is treated as a simple strategy with poor performance for traditional multi-armed bandit problems, performs very well in the order learning problem when the number of channels is small. And our proposed EOIE algorithm performs nearly optimal in all cases. In contrast, the UCB-based Order converges too slowly for actual application, although it provides theoretical zero-regret guarantee as we have proved.

Keywords :

cognitive radio; learning (artificial intelligence); probability; radio access networks; telecommunication computing; ε-greedy order algorithm; UCB-based order algorithm; channel availability probability; channel statistics; dynamic spectrum access network; exploitation with optimism initial estimation algorithm; multiarmed bandit problem; nonBayesian learning; pure exploitation algorithm; sequential channel access; sequential channel sensing; softmax order algorithm; Algorithm design and analysis; Availability; Channel estimation; Estimation; Indexes; Numerical models; Sensors; cognitive radio; online learning; sensing order;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2011 International Conference on

Conference_Location :

Beijing

Print_ISBN :

978-1-4577-1827-4

Type :

conf

DOI :

10.1109/CyberC.2011.91

Filename :

6079439

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2322900