• DocumentCode
    3389537
  • Title

    IP traffic classification based on machine learning

  • Author

    Qin, Donghong ; Yang, Jiahai ; Wang, Jiamian ; Zhang, Bin

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
  • fYear
    2011
  • fDate
    25-28 Sept. 2011
  • Firstpage
    882
  • Lastpage
    886
  • Abstract
    With the rapid development of Internet, many network applications (e.g., P2P) use dynamic ports and encryption technology, which makes the traditional port and payload-based classification methods ineffective. Hence, it is important and necessary to find the more effective ones. Currently the machine learning (ML) techniques provide a promising alternative one for IP traffic classification. In this work, we use the ML-based classification method to identify the classes of the unknown flows using the payload-independent statistical features such as packet-length and arrival-interval. In order to improve the efficiency of the classification methods, the feature reduction techniques are further adopted to refine the selected features for attaining a best group of features. Finally we compare and evaluate the ML classification algorithms based on the BRASIL data source in terms of the three metrics such as overall accuracy, average precision and average recall. Our experiments show that the decision-tree algorithm is the best ML one for IP traffic classification and is able to construct the real-time classification system.
  • Keywords
    IP networks; Internet; decision trees; learning (artificial intelligence); peer-to-peer computing; real-time systems; statistical analysis; telecommunication traffic; BRASIL data source; IP traffic classification; Internet; P2P network; arrival-interval; decision-tree algorithm; dynamic ports; encryption technology; feature reduction techniques; machine learning techniques; packet- length; payload-based classification methods; payload-independent statistical features; realtime classification system; Accuracy; Algorithm design and analysis; Classification algorithms; Feature extraction; Measurement; Multimedia communication; Quality of service; IP traffic flow classification; ML algorithm; features optimization; performance evaluation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication Technology (ICCT), 2011 IEEE 13th International Conference on
  • Conference_Location
    Jinan
  • Print_ISBN
    978-1-61284-306-3
  • Type

    conf

  • DOI
    10.1109/ICCT.2011.6158005
  • Filename
    6158005