DocumentCode
2074046
Title
Multi-armed bandit based opportunistic channel access: A consideration of switch cost
Author
Jing Huang ; Xiaoying Gan ; Xinxin Feng
Author_Institution
Dept. of Electron. Eng., Shanghai Jiao Tong Univ., Shanghai, China
fYear
2013
fDate
9-13 June 2013
Firstpage
1651
Lastpage
1655
Abstract
In this paper, we study on the problem of opportunistic channel access without prior information about channels. We model it as the multi-armed bandit (MAB) problem. There are N independent arms. The player can choose one arm to play each time and get a reward. Switch cost is taken into consideration when player switches arm. Switch cost includes reward loss and switch delay. The concept of regret is used to measure the performance of an access policy. We prove that the regret of Lai-Robbins policy with switch cost grows with time at logarithmic order as that without switch cost, though with a much higher leading constant. Then we propose a policy referred as reducing switch with advanced play (RSAP), whose regret is shown to grow with time at logarithmic order with a much smaller leading constant.
Keywords
multi-access systems; radio spectrum management; telecommunication switching; wireless channels; Lai-Robbins policy; MAB; RSAP; access policy performance measurement; independent arms; logarithmic order; multiarmed bandit based opportunistic channel access; reducing switch-with-advanced play; reward loss; spectrum under-utilization; static radio spectrum allocation policy; switch cost; switch delay; Ad hoc networks;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications (ICC), 2013 IEEE International Conference on
Conference_Location
Budapest
ISSN
1550-3607
Type
conf
DOI
10.1109/ICC.2013.6654753
Filename
6654753
Link To Document