• DocumentCode
    2089131
  • Title

    Internet Traffic Classification Using Machine Learning: A Token-based Approach

  • Author

    Wang, Yu ; Xiang, Yang ; Yu, Shunzheng

  • Author_Institution
    Sch. of Inf. Technol., Deakin Univ., Melbourne, VIC, Australia
  • fYear
    2011
  • fDate
    24-26 Aug. 2011
  • Firstpage
    285
  • Lastpage
    289
  • Abstract
    Due to the increasing unreliability of traditional port-based methods, Internet traffic classification has attracted a lot of research efforts in recent years. Quite a lot of previous papers have focused on using statistical characteristics as discriminators and applying machine learning techniques to classify the traffic flows. In this paper, we propose a novel machine learning based approach where the features are extracted from packet payload instead of flow statistics. Specifically, every flow is represented by a feature vector, in which each item indicates the occurrence of a particular token, i.e., a common substring, in the payload. We have applied various machine learning algorithms to evaluate the idea and used different feature selection schemes to identify the critical tokens. Experimental result based on a real-world traffic data set shows that the approach can achieve high accuracy with low overhead.
  • Keywords
    Internet; learning (artificial intelligence); pattern classification; Internet traffic classification; feature selection scheme; flow statistics; machine learning; packet payload; token-based approach; Bayesian methods; Classification algorithms; Internet; Machine learning; Payloads; Protocols; Training; Internet traffic classification; common substrings; feature selection; machine learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Science and Engineering (CSE), 2011 IEEE 14th International Conference on
  • Conference_Location
    Dalian, Liaoning
  • Print_ISBN
    978-1-4577-0974-6
  • Type

    conf

  • DOI
    10.1109/CSE.2011.58
  • Filename
    6062887