DocumentCode :
1395512
Title :
Queue-Aware Dynamic Clustering and Power Allocation for Network MIMO Systems via Distributed Stochastic Learning
Author :
Cui, Ying ; Huang, Qingqing ; Lau, Vincent K N
Author_Institution :
ECE Dept., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
Volume :
59
Issue :
3
fYear :
2011
fDate :
3/1/2011 12:00:00 AM
Firstpage :
1229
Lastpage :
1238
Abstract :
In this paper, we propose a two-timescale delay-optimal dynamic clustering and power allocation design for downlink network MIMO systems. The dynamic clustering control is adaptive to the global queue state information (GQSI) only and computed at the base station controller (BSC) over a longer time scale. On the other hand, the power allocations of all the BSs in each cluster are adaptive to both intracluster channel state information (CCSI) and intracluster queue state information (CQSI), and computed at each cluster manager (CM) over a shorter time scale. We show that the two-timescale delay-optimal control can be formulated as an infinite-horizon average cost constrained partially observed Markov decision process (CPOMDP). By exploiting the special problem structure, we derive an equivalent Bellman equation in terms of pattern selection Q-factor to solve the CPOMDP. To address the distributed requirement and computational complexity, we approximate the pattern selection Q-factor by the sum of per-cluster potential functions and propose a novel distributed online learning algorithm to estimate them distributedly. We show that the proposed distributed online learning algorithm converges almost surely. By exploiting the birth-death structure of the queue dynamics, we further decompose the per-cluster potential function into the sum of per-cluster per-user potential functions and formulate the instantaneous power allocation as a per-stage QSI-aware interference game played among all the CMs. The proposed QSI-aware simultaneous iterative water-filling algorithm (QSIWFA) is shown to achieve the Nash equilibrium (NE).
Keywords :
MIMO communication; Markov processes; delay systems; game theory; infinite horizon; optimal control; queueing theory; radiofrequency interference; telecommunication congestion control; Bellman equation; CCSI; CPOMDP; CQSI; GQSI; Nash equilibrium; QSI-aware interference game; QSI-aware simultaneous iterative water-filling algorithm; QSIWFA; constrained partially observed Markov decision process; delay-optimal control; delay-optimal dynamic clustering; distributed online learning algorithm; distributed stochastic learning; global queue state information; infinite-horizon average cost; intracluster channel state information; intracluster queue state information; network MIMO system; pattern selection Q-factor; per-cluster per-user potential function; per-cluster potential function; power allocation; queue-aware dynamic clustering; Delay-optimal; distributed stochastic learning; dynamic clustering; interference game; network MIMO;
fLanguage :
English
Journal_Title :
Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1053-587X
Type :
jour
DOI :
10.1109/TSP.2010.2097253
Filename :
5658162
Link To Document :
بازگشت