DocumentCode :
2668830
Title :
An Internet traffic analysis method with MapReduce
Author :
Lee, Youngseok ; Kang, Wonchul ; Son, Hyeongu
Author_Institution :
Chungnam Nat. Univ., Daejeon, South Korea
fYear :
2010
fDate :
19-23 April 2010
Firstpage :
357
Lastpage :
361
Abstract :
Internet traffic measurement and analysis have been usually performed on a high performance server that collects and examines packet or flow traces. However, when we monitor a large volume of traffic data for detailed statistics, a long-period or a large-scale network, it is not easy to handle Tera or Peta-byte traffic data with a single server. Common ways to reduce a large volume of continuously monitored traffic data are packet sampling or flow aggregation that results in coarse traffic statistics. As distributed parallel processing schemes have been recently developed due to the cloud computing platform and the cluster filesystem, they could be usefully applied to analyzing big traffic data. Thus, in this paper, we propose an Internet flow analysis method based on the MapReduce software framework of the cloud computing platform for a large-scale network. From the experiments with an open-source MapReduce system, Hadoop, we have verified that the MapReduce-based flow analysis method improves the flow statistics computation time by 72%, when compared with the popular flow data processing tool, flow-tools, on a single host. In addition, we showed that MapReduce-based programs complete the flow analysis job against a single node failure.
Keywords :
Internet; network servers; parallel processing; public domain software; statistical analysis; telecommunication traffic; Hadoop; Internet flow analysis method; Internet traffic measurement; MapReduce software framework; cloud computing platform; cluster filesystem; coarse traffic statistics; distributed parallel processing schemes; flow aggregation; flow statistics; flow traces; high performance server; open-source MapReduce system; packet sampling; packet traces; traffic data monitoring; Cloud computing; Fluid flow measurement; Internet; Large-scale systems; Monitoring; Network servers; Open source software; Performance analysis; Statistical distributions; Telecommunication traffic; Hadoop; MapReduce; NetFlow; cloud computing; traffic monitoring;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Network Operations and Management Symposium Workshops (NOMS Wksps), 2010 IEEE/IFIP
Conference_Location :
Osaka
Print_ISBN :
978-1-4244-6037-3
Type :
conf
DOI :
10.1109/NOMSW.2010.5486551
Filename :
5486551
Link To Document :
بازگشت