• DocumentCode
    1945656
  • Title

    Reinforcement learning in BitTorrent systems

  • Author

    Izhak-Ratzin, Rafit ; Park, Hyunggon ; Van der Schaar, Mihaela

  • Author_Institution
    Palo Alto Networks, Sunnyvale, CA, USA
  • fYear
    2011
  • fDate
    10-15 April 2011
  • Firstpage
    406
  • Lastpage
    410
  • Abstract
    In this paper, we propose a BitTorrent-like protocol that replaces the peer selection mechanisms in the regular BitTorrent protocol with a novel reinforcement learning based mechanism. The inherent operation of P2P systems, which involves repeated interactions among peers over a long time period, allows peers to efficiently identify free-riders as well as desirable collaborators by learning the behavior of their associated peers. Thus, it can help peers improve their download rates and discourage free-riding (FR), while improving fairness. We model the peers´ interactions in the BitTorrent-like network as a repeated interaction game, where we explicitly consider the strategic behavior of the peers. A peer that applies the reinforcement learning based mechanism uses a partial history of the observations on associated peers´ statistical reciprocal behaviors to determine its best responses and estimate the corresponding impact on its expected utility. The policy determines the peer´s resource reciprocations with other peers, which would maximize the peer´s long-term performance.
  • Keywords
    learning (artificial intelligence); peer-to-peer computing; protocols; P2P systems; bittorrent protocol; peer selection mechanisms; reinforcement learning; strategic behavior; Bandwidth; Games; History; Learning; Peer to peer computing; Protocols; Robustness; BitTorrent; P2P; reinforcement learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    INFOCOM, 2011 Proceedings IEEE
  • Conference_Location
    Shanghai
  • ISSN
    0743-166X
  • Print_ISBN
    978-1-4244-9919-9
  • Type

    conf

  • DOI
    10.1109/INFCOM.2011.5935192
  • Filename
    5935192