DocumentCode :
17825
Title :
Channel Selection for Network-Assisted D2D Communication via No-Regret Bandit Learning With Calibrated Forecasting
Author :
Maghsudi, Setareh ; Stanczak, Slawomir
Author_Institution :
Commun. & Inf. Theor. Group, Tech. Univ. of Berlin, Berlin, Germany
Volume :
14
Issue :
3
fYear :
2015
fDate :
Mar-15
Firstpage :
1309
Lastpage :
1322
Abstract :
We consider the distributed channel selection problem in the context of device-to-device (D2D) communication as an underlay to a cellular network. Underlaid D2D users communicate directly by utilizing the cellular spectrum, but their decisions are not governed by any centralized controller. Selfish D2D users that compete for access to the resources form a distributed system where the transmission performance depends on channel availability and quality. This information, however, is difficult to acquire. Moreover, the adverse effects of D2D users on cellular transmissions should be minimized. In order to overcome these limitations, we propose a network-assisted distributed channel selection approach in which D2D users are only allowed to use vacant cellular channels. This scenario is modeled as a multi-player multi-armed bandit game with side information, for which a distributed algorithmic solution is proposed. The solution is a combination of no-regret learning and calibrated forecasting, and can be applied to a broad class of multi-player stochastic learning problems, in addition to the formulated channel selection problem. Theoretical analysis shows that the proposed approach not only yields vanishing regret in comparison to the global optimal solution but also guarantees that the empirical joint frequencies of the game converge to the set of correlated equilibria.
Keywords :
calibration; cellular radio; distributed algorithms; forecasting theory; learning (artificial intelligence); calibrated forecasting; cellular channel; cellular network; cellular spectrum; cellular transmission performance; channel availability; channel quality; device-to-device communication; distributed algorithmic solution; formulated channel selection problem; multiplayer multiarmed bandit game; multiplayer stochastic learning problem; network-assisted D2D communication; network-assisted distributed channel selection approach; no-regret bandit learning; Availability; Calibration; Forecasting; Games; Interference; Joints; Wireless communication; Calibrated forecaster; channel selection; correlated equilibrium; learning; underlay device-to-device communication;
fLanguage :
English
Journal_Title :
Wireless Communications, IEEE Transactions on
Publisher :
ieee
ISSN :
1536-1276
Type :
jour
DOI :
10.1109/TWC.2014.2365803
Filename :
6939716
Link To Document :
بازگشت