DocumentCode
155974
Title
Stability of flow features for the identification of Internet applications
Author
Oliveira, M. Rosario ; Valadas, Rui ; Pietrzyk, Marcin ; Collange, Denis
Author_Institution
Dept. de Mat., Univ. de Lisboa, Lisbon, Portugal
fYear
2014
fDate
17-19 Sept. 2014
Firstpage
1
Lastpage
6
Abstract
One important requirement associated with the deployment of large scale classification infrastructures is the portability of classifiers, which allows a small number of pre-trained classifiers to be used on many sites and time periods. The portability can be severely degraded if the flow features used in the classification process lack stability, i.e. if they do not preserve their most relevant statistical properties across different sites and time periods. In this paper we propose a statistical procedure to evaluate the stability of flow features, which resorts to the notion of effect size. The procedure is used challenge the stability of popular flow features, such as the direction and size of the first four packets of a TCP connection. Our results, obtained with three high-quality traffic traces, clearly show that only some applications are portable, when using these features as discriminators. We also provide evidence of these findings based on the operation of the protocols underlying the Internet applications.
Keywords
Internet; statistical analysis; transport protocols; Internet identification; TCP connection; classification process; classifier portability; flow feature stability; high-quality traffic traces; large scale classification infrastructures; pre-trained classifiers; protocols; statistical procedure; statistical properties; Analysis of variance; Internet; Postal services; Protocols; Sociology; Stability analysis; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Telecommunications Network Strategy and Planning Symposium (Networks), 2014 16th International
Conference_Location
Funchal
Type
conf
DOI
10.1109/NETWKS.2014.6959223
Filename
6959223
Link To Document