DocumentCode :
2500125
Title :
Improving Performance of Network Traffic Classification Systems by Cleaning Training Data
Author :
Gargiulo, Francesco ; Sansone, Carlo
Author_Institution :
Dipt. di Inf. e Sist., Univ. degli Studi di Napoli Federico II, Naples, Italy
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
2768
Lastpage :
2771
Abstract :
In this paper we propose to apply an algorithm for finding out and cleaning mislabeled training sample in an adversarial learning context, in which a malicious user tries to camouflage training patterns in order to limit the classification system performance. In particular, we describe how this algorithm can be effectively applied to the problem of identifying HTTP traffic flowing through port TCP 80, where mislabeled samples can be forced by using port-spoofing attacks.
Keywords :
Internet; learning (artificial intelligence); pattern classification; security of data; HTTP traffic identification; TCP 80 port; adversarial learning context; mislabeled training sample cleaning; network traffic classification systems; port-spoofing attacks; training data cleaning; Accuracy; Cleaning; Context; Decision trees; Protocols; Training; Training data; Adversarial learning; Data Cleaning; Network Traffic Classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.678
Filename :
5597036
Link To Document :
بازگشت