• DocumentCode
    3528961
  • Title

    Generalization performance analysis of flow-based peer-to-peer traffic identification

  • Author

    Wang, Yi-Hsien ; Gau, Victor ; Bosaw, Trevor ; Hwang, Jenq-Neng ; Lippman, Alan ; Lieberman, D. ; Wu, I-Chen

  • Author_Institution
    Dept. of Comput. Sci., Nat. Chiao Tung Univ., Hsinchu
  • fYear
    2008
  • fDate
    16-19 Oct. 2008
  • Firstpage
    267
  • Lastpage
    272
  • Abstract
    In this paper, we develop a peer-to-peer (P2P) traffic identifier to facilitate quality of service (QoS) control in edge routers. Currently, since P2P applications consume a great percentage of Internet bandwidth, certain network optimization strategies are needed to improve the network performance. Traffic identification is the most important component that could be adopted in these optimization strategies. In this paper, we focus on developing a machine learning strategy to perform quick identification, and continuous tracking of flows associated with various P2P media streaming and file sharing applications. With the use of Random Forests (RF) and evaluated by using 10-fold cross validation, our method achieves greater than 98% accuracy rate and 89% precision rate of identifying the P2P flows, with less than 1% false positive rate. With the help of winner-take-all strategy, the generalization performance of using the RF built with data collected from one network to classify flows in other networks can achieve accuracy of being over 97%, with the precision being over 81% and the FP rate being below 2%.
  • Keywords
    learning (artificial intelligence); peer-to-peer computing; quality of service; random processes; telecommunication traffic; P2P media streaming; P2P traffic identifier; QoS control; file sharing; flow-based peer-to-peer traffic identification; generalization performance analysis; machine learning; quality of service; random forest; winner-take-all strategy; Bandwidth; Communication system traffic control; IP networks; Machine learning; Peer to peer computing; Performance analysis; Quality of service; Radio frequency; Radiofrequency identification; Streaming media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning for Signal Processing, 2008. MLSP 2008. IEEE Workshop on
  • Conference_Location
    Cancun
  • ISSN
    1551-2541
  • Print_ISBN
    978-1-4244-2375-0
  • Electronic_ISBN
    1551-2541
  • Type

    conf

  • DOI
    10.1109/MLSP.2008.4685491
  • Filename
    4685491