• DocumentCode
    2265863
  • Title

    Scalable analysis of network measurements with Hadoop and Pig

  • Author

    Samak, Taghrid ; Gunter, Daniel ; Hendrix, Valerie

  • Author_Institution
    Lawrence Berkeley Nat. Lab., Berkeley, CA, USA
  • fYear
    2012
  • fDate
    16-20 April 2012
  • Firstpage
    1254
  • Lastpage
    1259
  • Abstract
    The deployment of ubiquitous distributed monitoring infrastructure such as perfSONAR is greatly increasing the availability and quality of network performance data. Cross-cutting analyses are now possible that can detect anomalies and provide real-time automated alerts to network management services. However, scaling these analyses to the volumes of available data remains a difficult task. Although there is significant research into offline analysis techniques, most of these approaches do not address the systems and scalability issues. This work presents an analysis framework incorporating industry best-practices and tools to perform large-scale analyses. Our framework integrates the expressiveness of Pig, the scalability of Hadoop, and the analysis and visualization capabilities of R to achieve a significant increase in both speed and power of analysis. Evaluation of our framework on a large dataset of real measurements from perfSONAR demonstrate a large speedup and novel statistical capabilities.
  • Keywords
    computer network management; computer network performance evaluation; data visualisation; distributed databases; network operating systems; ubiquitous computing; Hadoop; Pig; anomaly detection; cross-cutting analysis; large-scale analysis; network management services; network measurements; network performance data availability; network performance data quality; offline analysis technique; perfSONAR; real-time automated alerts; scalability issues; ubiquitous distributed monitoring infrastructure deployment; visualization capabilities; Correlation; Delay; Equations; Histograms; Mathematical model; Receivers; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network Operations and Management Symposium (NOMS), 2012 IEEE
  • Conference_Location
    Maui, HI
  • ISSN
    1542-1201
  • Print_ISBN
    978-1-4673-0267-8
  • Electronic_ISBN
    1542-1201
  • Type

    conf

  • DOI
    10.1109/NOMS.2012.6212060
  • Filename
    6212060