• DocumentCode
    573841
  • Title

    Classification of content and users in BitTorrent by semi-supervised learning methods

  • Author

    Avrachenkov, Konstantin ; Gonçalves, Paulo ; Legout, Arnaud ; Sokol, Marina

  • Author_Institution
    INRIA Sophia Antipolis, Sophia Antipolis, France
  • fYear
    2012
  • fDate
    27-31 Aug. 2012
  • Firstpage
    625
  • Lastpage
    630
  • Abstract
    P2P downloads still represent a large portion of today´s Internet traffic. More than 100 million users operate BitTorrent and generate more than 30% of the total Internet traffic. Recently, a significant research effort has been done to develop tools for automatic classification of Internet traffic by application. The purpose of the present work is to provide a framework for subclassification of P2P traffic generated by the BitTorrent protocol. The general intuition is that the users with similar interests download similar contents. This intuition can be rigorously formalized with the help of graph based semi-supervised learning approach. We have chosen to work with a PageRank based semi-supervised learning method, which scales well with very large volumes of data. We provide recommendations for the choice of parameters in the PageRank based semi-supervised learning method. In particular, we show that it is advantageous to choose labelled points with large PageRank score.
  • Keywords
    Internet; graph theory; learning (artificial intelligence); peer-to-peer computing; protocols; telecommunication traffic; BitTorrent protocol; Internet traffic; P2P downloads; P2P traffic; PageRank based semi-supervised learning method; automatic classification; content classification; graph based semi-supervised learning approach; Accuracy; Internet; Motion pictures; Noise; Semisupervised learning; Supervised learning; Tutorials;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wireless Communications and Mobile Computing Conference (IWCMC), 2012 8th International
  • Conference_Location
    Limassol
  • Print_ISBN
    978-1-4577-1378-1
  • Type

    conf

  • DOI
    10.1109/IWCMC.2012.6314276
  • Filename
    6314276