DocumentCode :
2015864
Title :
Traffic anomaly detection based on the IP size distribution
Author :
Soldo, Fabio ; Metwally, Ahmed
Author_Institution :
Google Inc., Mountain View, CA, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
2005
Lastpage :
2013
Abstract :
In this paper we present a data-driven framework for detecting machine-generated traffic based on the IP size, i.e., the number of users sharing the same source IP. Our main observation is that diverse machine-generated traffic attacks share a common characteristic: they induce an anomalous deviation from the expected IP size distribution. We develop a principled framework that automatically detects and classifies these deviations using statistical tests and ensemble learning. We evaluate our approach on a massive dataset collected at Google for 90 consecutive days. We argue that our approach combines desirable characteristics: it can accurately detect fraudulent machine-generated traffic; it is based on a fundamental characteristic of these attacks and is thus robust (e.g., to DHCP re-assignment) and hard to evade; it has low complexity and is easy to parallelize, making it suitable for large-scale detection; and finally, it does not entail profiling users, but leverages only aggregate statistics of network traffic.
Keywords :
IP networks; learning (artificial intelligence); statistical testing; telecommunication security; telecommunication traffic; DHCP reassignment; Google; IP size distribution; anomalous deviation; data-driven framework; ensemble learning; fraudulent machine-generated traffic attack detection; network traffic; statistical tests; traffic anomaly detection; Advertising; Aggregates; Bismuth; Electronic mail; Google; Histograms; IP networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
INFOCOM, 2012 Proceedings IEEE
Conference_Location :
Orlando, FL
ISSN :
0743-166X
Print_ISBN :
978-1-4673-0773-4
Type :
conf
DOI :
10.1109/INFCOM.2012.6195581
Filename :
6195581
Link To Document :
بازگشت