• DocumentCode
    968000
  • Title

    A Best Practice Guide to Resource Forecasting for Computing Systems

  • Author

    Hoffmann, Guenther A. ; Trivedi, Kishor S. ; Malek, Miroslaw

  • Author_Institution
    Duke Univ., Durham
  • Volume
    56
  • Issue
    4
  • fYear
    2007
  • Firstpage
    615
  • Lastpage
    628
  • Abstract
    Recently, measurement-based studies of software systems have proliferated, reflecting an increasingly empirical focus on system availability, reliability, aging, and fault tolerance. However, it is a nontrivial, error-prone, arduous, and time-consuming task even for experienced system administrators, and statistical analysts to know what a reasonable set of steps should include to model, and successfully predict performance variables, or system failures of a complex software system. Reported results are fragmented, and focus on applying statistical regression techniques to monitored numerical system data. In this paper, we propose a best practice guide for building empirical models based on our experience with forecasting Apache web server performance variables, and forecasting call availability of a real-world telecommunication system. To substantiate the presented guide, and to demonstrate our approach in a step by step manner, we model, and predict the response time, and the amount of free physical memory of an Apache web server system, as well as the call availability of an industrial telecommunication system. Additionally, we present concrete results for a) variable selection where we cross benchmark three procedures, b) empirical model building where we cross benchmark four techniques, and c) sensitivity analysis. This best practice guide intends to assist in configuring modeling approaches systematically for best estimation, and prediction results.
  • Keywords
    Internet; fault tolerant computing; file servers; large-scale systems; regression analysis; software reliability; Apache Web server performance; complex software system; computing systems; empirical models; fault tolerance; industrial telecommunication system; measurement-based studies; real-world telecommunication system; resource forecasting; software systems; statistical analysts; statistical regression techniques; system administrators; system availability; system reliability; Aging; Availability; Best practices; Failure analysis; Fault tolerant systems; Performance analysis; Predictive models; Software measurement; Software systems; Web server; Apache web server; failure forecasting; monitoring; non-parametric modeling; prediction of resource utilization; quantitative analysis; statistical modeling; telecommunication systems;
  • fLanguage
    English
  • Journal_Title
    Reliability, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9529
  • Type

    jour

  • DOI
    10.1109/TR.2007.909764
  • Filename
    4378407