• DocumentCode
    173015
  • Title

    Diagnosing Cloud Performance Anomalies Using Large Time Series Dataset Analysis

  • Author

    Jehangiri, Ali Imran ; Yahyapour, Ramin ; Wieder, Philipp ; Yaqub, Edwin ; Kuan Lu

  • Author_Institution
    Gesellschaft Fur Wissenschaftliche Datenverarbeitung mbH Gottingen (GWDG), Gottingen, Germany
  • fYear
    2014
  • fDate
    June 27 2014-July 2 2014
  • Firstpage
    930
  • Lastpage
    933
  • Abstract
    Virtualized Cloud platforms have become increasingly common and the number of online services hosted on these platforms is also increasing rapidly. A key problem faced by providers in managing these services is detecting the performance anomalies and adjusting resources accordingly. As online services generate a very large amount of monitored data in the form of time series, it becomes very difficult to process this complex data by traditional approaches. In this work, we present a novel distributed parallel approach for performance anomaly detection. We build upon Holt-Winters forecasting for automatic aberrant behavior detection in time series. First, we extend the technique to work with MapReduce paradigm. Next, we correlate the anomalous metrics with the target Service Level Objective (SLO) in order to locate the suspicious metrics. We implemented and evaluated our approach on a production Cloud encompassing IaaS and PaaS service models. Experimental results confirm that our approach is efficient and effective in capturing the metrics causing performance anomalies in large time series datasets.
  • Keywords
    cloud computing; data analysis; forecasting theory; parallel processing; quality of service; time series; Holt-Winters forecasting; IaaS service models; MapReduce paradigm; PaaS service models; SLO; automatic aberrant behavior detection; cloud performance anomaly diagnosis; distributed parallel approach; large time series dataset analysis; online services; production cloud; service level objective; virtualized cloud platforms; Cloud computing; Distributed databases; Measurement; Monitoring; Scalability; Time factors; Time series analysis; Analytics; Cloud; Distributed Time-Series database; Monitoring; Performance Diagnosis; QoS;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing (CLOUD), 2014 IEEE 7th International Conference on
  • Conference_Location
    Anchorage, AK
  • Print_ISBN
    978-1-4799-5062-1
  • Type

    conf

  • DOI
    10.1109/CLOUD.2014.129
  • Filename
    6973835