• DocumentCode
    573840
  • Title

    Internet traffic clustering with constraints

  • Author

    Wang, Yu ; Xiang, Yang ; Zhang, Jun ; Yu, Shunzheng

  • Author_Institution
    Sch. of Inf. Technol., Deakin Univ., Melbourne, VIC, Australia
  • fYear
    2012
  • fDate
    27-31 Aug. 2012
  • Firstpage
    619
  • Lastpage
    624
  • Abstract
    Due to the limitations of the traditional port-based and payload-based traffic classification approaches, the past decade has seen extensive work on utilizing machine learning techniques to classify network traffic based on packet and flow level features. In particular, previous studies have shown that the unsupervised clustering approach is both accurate and capable of discovering previously unknown application classes. In this paper, we explore the utility of side information in the process of traffic clustering. Specifically, we focus on the flow correlation information that can be efficiently extracted from packet headers and expressed as instance-level constraints, which indicate that particular sets of flows are using the same application and thus should be put into the same cluster. To incorporate the constraints, we propose a modified constrained K-Means algorithm. A variety of real-world traffic traces are used to show that the constraints are widely available. The experimental results indicate that the constrained approach not only improves the quality of the resulted clusters, but also speeds up the convergence of the clustering process.
  • Keywords
    Internet; pattern classification; pattern clustering; telecommunication traffic; unsupervised learning; Internet traffic clustering; flow level features; instance-level constraints; machine learning techniques; modified constrained k-mean algorithm; network traffic classification; packet headers; packet level features; payload-based traffic classification approach; port-based traffic classification approach; traffic clustering process; unsupervised clustering approach; Accuracy; Classification algorithms; Clustering algorithms; IP networks; Internet; Payloads; Protocols; constrained clustering; constraints; machine learning; semi-supervised learning; traffic classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wireless Communications and Mobile Computing Conference (IWCMC), 2012 8th International
  • Conference_Location
    Limassol
  • Print_ISBN
    978-1-4577-1378-1
  • Type

    conf

  • DOI
    10.1109/IWCMC.2012.6314275
  • Filename
    6314275