Title :
Improving Performance of Network Traffic Classification Systems by Cleaning Training Data
Author :
Gargiulo, Francesco ; Sansone, Carlo
Author_Institution :
Dipt. di Inf. e Sist., Univ. degli Studi di Napoli Federico II, Naples, Italy
Abstract :
In this paper we propose to apply an algorithm for finding out and cleaning mislabeled training sample in an adversarial learning context, in which a malicious user tries to camouflage training patterns in order to limit the classification system performance. In particular, we describe how this algorithm can be effectively applied to the problem of identifying HTTP traffic flowing through port TCP 80, where mislabeled samples can be forced by using port-spoofing attacks.
Keywords :
Internet; learning (artificial intelligence); pattern classification; security of data; HTTP traffic identification; TCP 80 port; adversarial learning context; mislabeled training sample cleaning; network traffic classification systems; port-spoofing attacks; training data cleaning; Accuracy; Cleaning; Context; Decision trees; Protocols; Training; Training data; Adversarial learning; Data Cleaning; Network Traffic Classification;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.678