DocumentCode
2335136
Title
SPCp1-08: Adaptive Learning of Transmission Control Policies for MIMO Fading Channels under Delay Constraint
Author
Djonin, Dejan V. ; Krishnamurthy, Vikram
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of British Columbia, Vancouver, BC
fYear
2006
fDate
Nov. 27 2006-Dec. 1 2006
Firstpage
1
Lastpage
5
Abstract
This paper addresses learning based adaptive resource allocation for wireless MIMO channels with Markovian fading. The problem is posed as constrained Markov decision process with the goal of minimizing the average transmission cost (such as the transmission power) with the constraint on the average holding cost (such as the transmitter delay). Standard Q-learning algorithm is employed to adaptively find the optimal policy for unknown channel/traffic statistics, its convergence properties discussed and shown that it can relatively quickly compute the optimal policy even for rather large state spaces. In order to further improve the convergence rate of the standard Q- learning, we establish several structural results on the optimal policies. We show that the optimal transmission policy is monotonic in the buffer occupancy. This permits us to utilize the supermodularity of the Q-factors and form a structured Q-learning algorithm that increases the convergence rate with respect to the standard Q-learning algorithm.
Keywords
MIMO communication; Markov processes; Q-factor; fading channels; telecommunication network management; MIMO fading channels; Markovian fading; Q-factors; Q-learning algorithm; adaptive resource allocation; average transmission cost; constrained Markov decision process; delay constraint; transmission control policies; Adaptive control; Convergence; Costs; Delay; Fading; MIMO; Programmable control; Resource management; Statistics; Transmitters;
fLanguage
English
Publisher
ieee
Conference_Titel
Global Telecommunications Conference, 2006. GLOBECOM '06. IEEE
Conference_Location
San Francisco, CA
ISSN
1930-529X
Print_ISBN
1-4244-0356-1
Electronic_ISBN
1930-529X
Type
conf
DOI
10.1109/GLOCOM.2006.610
Filename
4151240
Link To Document