DocumentCode
1542258
Title
Distributed Reinforcement Learning Frameworks for Cooperative Retransmission in Wireless Networks
Author
Naddafzadeh-Shirazi, Ghasem ; Kong, Peng-Yong ; Tham, Chen-Khong
Author_Institution
Inst. for Infocomm Res., Agency for Sci., Technol. & Res., Singapore, Singapore
Volume
59
Issue
8
fYear
2010
Firstpage
4157
Lastpage
4162
Abstract
We address the problem of cooperative retransmission in the media access control (MAC) layer of a distributed wireless network with spatial reuse, where there can be multiple concurrent transmissions from the source and relay nodes. We propose a novel Markov decision process (MDP) framework for adjusting the transmission powers and transmission probabilities in the source and relay nodes to achieve the highest network throughput per unit of consumed energy. We also propose distributed methods that avoid solving a centralized MDP model with a large number of states by employing model-free reinforcement learning (RL) algorithms. We show the convergence to a local solution and compute a lower bound for the performance of the proposed RL algorithms. We further empirically confirm that the proposed learning schemes are robust to collisions and are scalable with regard to the network size and can provide significant cooperative diversity while enjoying low complexity and fast convergence.
Keywords
Markov processes; access protocols; learning (artificial intelligence); radio networks; Markov decision process; concurrent transmissions; cooperative retransmission; distributed reinforcement learning; distributed wireless network; media access control; Councils; Electrical capacitance tomography; Frame relay; Helium; Learning; Materials science and technology; Permission; Robustness; Throughput; Wireless networks; Distributed Markov decision process (MDP) for wireless networks; media access control (MAC) cooperative retransmission; reinforcement learning (RL);
fLanguage
English
Journal_Title
Vehicular Technology, IEEE Transactions on
Publisher
ieee
ISSN
0018-9545
Type
jour
DOI
10.1109/TVT.2010.2059055
Filename
5512673
Link To Document