• DocumentCode
    2074046
  • Title

    Multi-armed bandit based opportunistic channel access: A consideration of switch cost

  • Author

    Jing Huang ; Xiaoying Gan ; Xinxin Feng

  • Author_Institution
    Dept. of Electron. Eng., Shanghai Jiao Tong Univ., Shanghai, China
  • fYear
    2013
  • fDate
    9-13 June 2013
  • Firstpage
    1651
  • Lastpage
    1655
  • Abstract
    In this paper, we study on the problem of opportunistic channel access without prior information about channels. We model it as the multi-armed bandit (MAB) problem. There are N independent arms. The player can choose one arm to play each time and get a reward. Switch cost is taken into consideration when player switches arm. Switch cost includes reward loss and switch delay. The concept of regret is used to measure the performance of an access policy. We prove that the regret of Lai-Robbins policy with switch cost grows with time at logarithmic order as that without switch cost, though with a much higher leading constant. Then we propose a policy referred as reducing switch with advanced play (RSAP), whose regret is shown to grow with time at logarithmic order with a much smaller leading constant.
  • Keywords
    multi-access systems; radio spectrum management; telecommunication switching; wireless channels; Lai-Robbins policy; MAB; RSAP; access policy performance measurement; independent arms; logarithmic order; multiarmed bandit based opportunistic channel access; reducing switch-with-advanced play; reward loss; spectrum under-utilization; static radio spectrum allocation policy; switch cost; switch delay; Ad hoc networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (ICC), 2013 IEEE International Conference on
  • Conference_Location
    Budapest
  • ISSN
    1550-3607
  • Type

    conf

  • DOI
    10.1109/ICC.2013.6654753
  • Filename
    6654753