مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning from collisions in cognitive radio networks: Time Division Fair Sharing without pre-agreement

DocumentCode :

1940722

Title :

Learning from collisions in cognitive radio networks: Time Division Fair Sharing without pre-agreement

Author :

Liu, Keqin ; Zhao, Qing

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of California, Davis, CA, USA

fYear :

2010

fDate :

Oct. 31 2010-Nov. 3 2010

Firstpage :

2262

Lastpage :

2267

Abstract :

We consider a decentralized multi-armed bandit problem arisen in the application of cognitive radio networks, where M distributed secondary users independently search for spectrum opportunities in N channels without information sharing. The channel occupancy is modeled as an i.i.d. Bernoulli process with unknown mean. A collision happens when multiple users choose the same channel, and none or only one receives reward depending on the collision model. Under a non-Bayesian formulation, the performance measure of a policy is given by the system regret, defied as the total reward loss with respect to the optimal performance in the ideal scenario where all channel parameters are known to all users and collisions among secondary users are eliminated through centralized scheduling. In our previous work, a Time Division Fair Sharing (TDFS) framework was proposed for constructing fair decentralized policies that achieve the same logarithmic order of the system regret growth rate as in the centralized counterpart where users exchange observations and make decisions jointly. This TDFS framework, however, requires pre-agreement among users regarding the offset in their time sharing schedule. In this work, we generalize the TDFS framework by eliminating the pre-agreement thus achieve a complete decentralization among users. We show that by learning from collisions, users can settle at orthogonal time-sharing offsets and the TDFS framework can maintain its logarithmic regret order and the fairness among users without pre-agreement. The result applies to general stochastic processes beyond Bernoulli and thus finds a wide range of applications including multi-channel communication systems, multi-agent systems, web search and advertising, and social networks.

Keywords :

cognitive radio; stochastic processes; TDFS framework; The channel occupancy; Web search; centralized scheduling; cognitive radio networks; collision learning; decentralized multiarmed bandit problem; i.i.d. Bernoulli process; logarithmic regret order; multiagent systems; multichannel communication systems; nonBayesian formulation; orthogonal time-sharing offsets; social networks; stochastic processes; time division fair sharing framework; Cognitive radio; Couplings; History; Lead; Loss measurement; Schedules; Sensors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

MILITARY COMMUNICATIONS CONFERENCE, 2010 - MILCOM 2010

Conference_Location :

San Jose, CA

ISSN :

2155-7578

Print_ISBN :

978-1-4244-8178-1

Type :

conf

DOI :

10.1109/MILCOM.2010.5680378

Filename :

5680378

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1940722