DocumentCode :
2220007
Title :
Is machine learning losing the battle to produce transportable signatures against VoIP traffic?
Author :
Alshammari, Riyad ; Zincir-Heywood, A. Nur
Author_Institution :
Fac. of Comput. Sci., Dalhousie Univ., Halifax, NS, Canada
fYear :
2011
fDate :
5-8 June 2011
Firstpage :
1543
Lastpage :
1550
Abstract :
Traffic classification becomes more challenging since the traditional techniques such as port numbers or deep packet inspection are ineffective against voice over IP (VoIP) applications, which uses non-standard ports and encryption. Statistical information based on network layer with the use of machine learning (ML) can achieve high classification accuracy and produce transportable signatures. However, the ability of ML to find transportable signatures depends mainly on the training data sets. In this paper, we explore the importance of sampling training data sets for the ML algorithms, specifically Genetic Programming, C5.0, Naive Bayesian and AdaBoost, to find transportable signatures. To this end, we employed two techniques for sampling network training data sets, namely random sampling and consecutive sampling. Results show that random sampling and 90-minute consecutive sampling have the best performance in terms of accuracy using C5.0 and SBB, respectively. In terms of complexity, the size of C5.0 solutions increases as the training size increases, whereas SBB finds simpler solutions.
Keywords :
Bayes methods; Internet telephony; genetic algorithms; learning (artificial intelligence); telecommunication security; telecommunication traffic; AdaBoost; C5.0; VoIP traffic classification; consecutive sampling; genetic programming; machine learning; naive Bayesian; random sampling; transportable signatures; voice over IP; Bayesian methods; Cryptography; Decision trees; Payloads; Protocols; Training; Training data; Classification; Encryption; Machine Learning; Sampling training data sets; Security; VoIP; transportability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation (CEC), 2011 IEEE Congress on
Conference_Location :
New Orleans, LA
ISSN :
Pending
Print_ISBN :
978-1-4244-7834-7
Type :
conf
DOI :
10.1109/CEC.2011.5949799
Filename :
5949799
Link To Document :
بازگشت