Title :
A new re-sampling method for network traffic classification using SML
Author :
Ruoyu, Wang ; Zhen, Liu ; Ling, Zhang
Author_Institution :
Network Engineering and Research Center, South China University of Technology, Guangzhou, China
Abstract :
The way of internet traffic classification using machine learning has been a hot topic for a long time as it is independent on the packet payloads. However, the problems of classifier biasing towards the majority classes have not been solved effectively now. The uniform sampling is a popular technique to alleviate the data skew in machine learning traffic classification. But the original traffic data distribution would be destroyed by it. A new re-sampling method named tuning sampling for supervised machine learning (SML) is proposed to ease the problem of data skew in internet traffic classification. And it is compared with uniform sampling and stratified sampling methods using C4.5 classification algorithm. Our experimental results indicate that the classifier using tuning sampling gets the accuracy of minority classes are higher than the results of stratified sampling and the overall accuracy is higher than the result of uniform sampling.
Keywords :
Accuracy; Classification algorithms; Internet; Servers; Training; Tuning; World Wide Web; Data skew; Machine learning; Network Traffic classification;
Conference_Titel :
Information Science and Engineering (ICISE), 2010 2nd International Conference on
Conference_Location :
Hangzhou, China
Print_ISBN :
978-1-4244-7616-9
DOI :
10.1109/ICISE.2010.5688893