Title :
Learning from collisions in cognitive radio networks: Time Division Fair Sharing without pre-agreement
Author :
Liu, Keqin ; Zhao, Qing
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, Davis, CA, USA
fDate :
Oct. 31 2010-Nov. 3 2010
Abstract :
We consider a decentralized multi-armed bandit problem arisen in the application of cognitive radio networks, where M distributed secondary users independently search for spectrum opportunities in N channels without information sharing. The channel occupancy is modeled as an i.i.d. Bernoulli process with unknown mean. A collision happens when multiple users choose the same channel, and none or only one receives reward depending on the collision model. Under a non-Bayesian formulation, the performance measure of a policy is given by the system regret, defied as the total reward loss with respect to the optimal performance in the ideal scenario where all channel parameters are known to all users and collisions among secondary users are eliminated through centralized scheduling. In our previous work, a Time Division Fair Sharing (TDFS) framework was proposed for constructing fair decentralized policies that achieve the same logarithmic order of the system regret growth rate as in the centralized counterpart where users exchange observations and make decisions jointly. This TDFS framework, however, requires pre-agreement among users regarding the offset in their time sharing schedule. In this work, we generalize the TDFS framework by eliminating the pre-agreement thus achieve a complete decentralization among users. We show that by learning from collisions, users can settle at orthogonal time-sharing offsets and the TDFS framework can maintain its logarithmic regret order and the fairness among users without pre-agreement. The result applies to general stochastic processes beyond Bernoulli and thus finds a wide range of applications including multi-channel communication systems, multi-agent systems, web search and advertising, and social networks.
Keywords :
cognitive radio; stochastic processes; TDFS framework; The channel occupancy; Web search; centralized scheduling; cognitive radio networks; collision learning; decentralized multiarmed bandit problem; i.i.d. Bernoulli process; logarithmic regret order; multiagent systems; multichannel communication systems; nonBayesian formulation; orthogonal time-sharing offsets; social networks; stochastic processes; time division fair sharing framework; Cognitive radio; Couplings; History; Lead; Loss measurement; Schedules; Sensors;
Conference_Titel :
MILITARY COMMUNICATIONS CONFERENCE, 2010 - MILCOM 2010
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-8178-1
DOI :
10.1109/MILCOM.2010.5680378