Conjectural variations in multi-agent reinforcement learning for energy-efficient cognitive wireless mesh networks

Author

Chen, Xianfu ; Zhao, Zhifeng ; Zhang, Honggang ; Chen, Tao

Author_Institution

York-Zhejiang Lab. for Cognitive Radio & Green Commun., Zhejiang Univ., Hangzhou, China

fYear

2012

fDate

1-4 April 2012

Firstpage

820

Lastpage

825

Abstract

As energy saving and environmental protection become an inevitable trend, researchers need to shift their focus to “green” oriented architecture design. Recent advances in the area of cognitive radio (CR) have significant potential towards “green” communications. One of the critical challenges for operating CRs in a wireless mesh network is how to efficiently allocate transmission powers and frequency resource among the secondary users (SUs) while satisfying the quality-of-service constraints of primary users. Due to the SUs´ intelligent and selfish properties, this paper focuses on the non-cooperative spectrum sharing in cognitive wireless mesh networks formed by a number of clusters. In order to study the competition behaviors of SUs in a dynamic environment, the problem is modeled as a stochastic learning process. We first extend the single-agent reinforcement learning (RL) to a multi-user context, based on which a conjecture based multi-agent RL algorithm is proposed. A rational SU learns the optimal transmission strategy from the conjecture over the other SUs´ responses.

Keywords

carrier transmission on power lines; cognitive radio; energy conservation; environmental factors; learning (artificial intelligence); multi-agent systems; quality of service; radio spectrum management; stochastic processes; telecommunication computing; wireless mesh networks; SU intelligent properties; SU responses; SU selfish properties; cognitive radio; competition behaviors; conjectural variations; conjecture-based multiagent RL algorithm; dynamic environment; energy-efficient cognitive wireless mesh networks; frequency resource; green communications; green oriented architecture design; multiagent reinforcement learning; multiuser context; noncooperative spectrum sharing; operating CR; optimal transmission strategy; quality-of-service constraints; rational SU; secondary users; single-agent reinforcement learning; stochastic learning process; transmission powers allocation; Games; Heuristic algorithms; Interference; Learning; Receivers; Resource management; Signal to noise ratio;

fLanguage

English

Publisher

ieee

Conference_Titel

Wireless Communications and Networking Conference (WCNC), 2012 IEEE

Conference_Location

Shanghai

ISSN

1525-3511

Print_ISBN

978-1-4673-0436-8

Type

conf

DOI

10.1109/WCNC.2012.6214485

Filename

6214485