• DocumentCode
    358162
  • Title

    An approach to on-line predictive detection

  • Author

    Zhang, Fan ; Hellerstein, Joseph L.

  • Author_Institution
    Dept. of Ind. Eng. & Oper. Res., Columbia Univ., New York, NY, USA
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    549
  • Lastpage
    556
  • Abstract
    Predicting network performance problems enables network operators to take corrective actions in advance of service disruptions. Typically, service problems are detected by tests that compare a metric (e.g., response time) to a threshold. The authors present an online algorithm for predicting the probability of threshold violations over a time horizon. The algorithm uses two cascaded submodels. The first removes non-stationarities by employing a discrete time Kalman filter in combination with analysis of variance. We derive parameters of the Kalman filter from differential equations that describe characteristics of the data. The second submodel estimates the probability of threshold violations by using a second order autoregressive model in combination with change-point detection. Using data from a production Web server, we evaluate our approach and show that it produces average accuracies that are comparable to those of an offline algorithm. However, our online algorithm produces predictions with considerably smaller variances. Further advantages of our approach are: (a) requiring much less data than the offline technique, one day versus multiple months; and (b) adapting to changes in the system and workloads since parameters are estimated online
  • Keywords
    Kalman filters; autoregressive processes; computer network management; differential equations; discrete time filters; file servers; performance evaluation; analysis of variance; cascaded submodels; change-point detection; corrective actions; differential equations; discrete time Kalman filter; network operators; network performance problem prediction; non-stationarities; offline algorithm; offline technique; online algorithm; online predictive detection; parameter estimation; production Web server; response time; second order autoregressive model; service disruptions; service problems; threshold violations; time horizon; Analysis of variance; Delay; Differential equations; Kalman filters; Network servers; Predictive models; Rivers; Testing; Training data; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Modeling, Analysis and Simulation of Computer and Telecommunication Systems, 2000. Proceedings. 8th International Symposium on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1526-7539
  • Print_ISBN
    0-7695-0728-X
  • Type

    conf

  • DOI
    10.1109/MASCOT.2000.876583
  • Filename
    876583