DocumentCode
2295968
Title
Conjectural variations in multi-agent reinforcement learning for energy-efficient cognitive wireless mesh networks
Author
Chen, Xianfu ; Zhao, Zhifeng ; Zhang, Honggang ; Chen, Tao
Author_Institution
York-Zhejiang Lab. for Cognitive Radio & Green Commun., Zhejiang Univ., Hangzhou, China
fYear
2012
fDate
1-4 April 2012
Firstpage
820
Lastpage
825
Abstract
As energy saving and environmental protection become an inevitable trend, researchers need to shift their focus to “green” oriented architecture design. Recent advances in the area of cognitive radio (CR) have significant potential towards “green” communications. One of the critical challenges for operating CRs in a wireless mesh network is how to efficiently allocate transmission powers and frequency resource among the secondary users (SUs) while satisfying the quality-of-service constraints of primary users. Due to the SUs´ intelligent and selfish properties, this paper focuses on the non-cooperative spectrum sharing in cognitive wireless mesh networks formed by a number of clusters. In order to study the competition behaviors of SUs in a dynamic environment, the problem is modeled as a stochastic learning process. We first extend the single-agent reinforcement learning (RL) to a multi-user context, based on which a conjecture based multi-agent RL algorithm is proposed. A rational SU learns the optimal transmission strategy from the conjecture over the other SUs´ responses.
Keywords
carrier transmission on power lines; cognitive radio; energy conservation; environmental factors; learning (artificial intelligence); multi-agent systems; quality of service; radio spectrum management; stochastic processes; telecommunication computing; wireless mesh networks; SU intelligent properties; SU responses; SU selfish properties; cognitive radio; competition behaviors; conjectural variations; conjecture-based multiagent RL algorithm; dynamic environment; energy-efficient cognitive wireless mesh networks; frequency resource; green communications; green oriented architecture design; multiagent reinforcement learning; multiuser context; noncooperative spectrum sharing; operating CR; optimal transmission strategy; quality-of-service constraints; rational SU; secondary users; single-agent reinforcement learning; stochastic learning process; transmission powers allocation; Games; Heuristic algorithms; Interference; Learning; Receivers; Resource management; Signal to noise ratio;
fLanguage
English
Publisher
ieee
Conference_Titel
Wireless Communications and Networking Conference (WCNC), 2012 IEEE
Conference_Location
Shanghai
ISSN
1525-3511
Print_ISBN
978-1-4673-0436-8
Type
conf
DOI
10.1109/WCNC.2012.6214485
Filename
6214485
Link To Document