• DocumentCode
    1809678
  • Title

    Information sharing in distributed stochastic bandits

  • Author

    Buccapatnam, Swapna ; Jian Tan ; Li Zhang

  • Author_Institution
    Dept. of Electr. Eng., Princeton Univ., Princeton, NJ, USA
  • fYear
    2015
  • fDate
    April 26 2015-May 1 2015
  • Firstpage
    2605
  • Lastpage
    2613
  • Abstract
    Information sharing is an important issue for stochastic bandit problems in a distributed setting. Consider N players dealing with the same multi-armed bandit problem. All players receive requests simultaneously and must choose one of M actions for each request. Sharing information among these N players can decrease the regret for each of them but also incurs cooperation and communication overhead. In this setting, we study how cooperation and communication can impact the system performance measured by regret and communication cost. For both scenarios, we establish a uniform lower bound to the regret for the entire system as a function of time and network size. Concerning cooperation, we study the problem from a game-theoretic perspective. When each player´s actions and payoffs are immediately visible to all others, we identify strategies for all players under which co-operative exploration is ensured. Regarding the communication cost, we consider incomplete information sharing such that a player´s payoffs and actions are not entirely available to others. The players communicate observations to each other to reduce their regret, however with a cost. We show that a logarithmic communication cost is necessary to achieve the optimal regret. For Bernoulli arrivals, we specify a policy that achieves the optimal regret with a logarithmic communication cost. Our work opens a novel direction towards understanding information sharing for active learning in a distributed environment.
  • Keywords
    game theory; statistical distributions; Bernoulli arrivals; distributed stochastic bandit; game-theoretic perspective; information sharing; logarithmic communication cost; multiarmed bandit problem; Computers; Conferences; Games; Information management; Monitoring; Nash equilibrium; Probability distribution;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communications (INFOCOM), 2015 IEEE Conference on
  • Conference_Location
    Kowloon
  • Type

    conf

  • DOI
    10.1109/INFOCOM.2015.7218651
  • Filename
    7218651