Title :
On the use of Sub-Space Clustering & Evidence Accumulation for traffic analysis & classification
Author :
Casas, Pedro ; Mazel, Johan ; Owezarski, Philippe
Author_Institution :
LAAS, CNRS, Toulouse, France
Abstract :
Driven by the well-known limitations of port and payload-based analysis techniques, the use of Machine Learning for Internet traffic analysis and classification has become a fertile research area during the past half-decade. In this paper we introduce a novel unsupervised approach to identify different classes of IP flows sharing similar characteristics. The unsupervised analysis is accomplished by means of robust clustering techniques, using Sub-Space Clustering, Evidence Accumulation, and Hierarchical Clustering algorithms to explore inter-flows structure. Our approach permits to identify natural groupings of traffic flows, combining the evidence of data structure provided by different partitions of the same set of traffic flows. The technique is further used to build an automatic flow classification model, using a semi-supervised-learning-based approach. The approach uses only a reduced fraction of labeled flows to map the identified clusters into their associated most-probable originating application, which strongly simplifies its calibration. We evaluate the performance of our techniques using real traffic traces, additionally comparing their performance against previously proposed clustering-based classification methods.
Keywords :
Internet; learning (artificial intelligence); pattern clustering; performance evaluation; telecommunication traffic; IP flow; Internet traffic analysis; Internet traffic classification; calibration; evidence accumulation; hierarchical clustering algorithms; machine learning; performance evaluation; semisupervised-learning-based approach; subspace clustering; Accuracy; Algorithm design and analysis; Clustering algorithms; Partitioning algorithms; Robustness; Training; Vegetation; Evidence Accumulation; Hierarchical Clustering; Semi-Supervised Traffic Classification; Sub-Space Clustering; Unsupervised Traffic Analysis;
Conference_Titel :
Wireless Communications and Mobile Computing Conference (IWCMC), 2011 7th International
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-9539-9
DOI :
10.1109/IWCMC.2011.5982680