• DocumentCode
    3515642
  • Title

    System log pre-processing to improve failure prediction

  • Author

    Zheng, Ziming ; Lan, Zhiling ; Park, Byung H. ; Geist, Al

  • Author_Institution
    Illinois Inst. of Technol., Chicago, TN, USA
  • fYear
    2009
  • fDate
    June 29 2009-July 2 2009
  • Firstpage
    572
  • Lastpage
    577
  • Abstract
    Log preprocessing, a process applied on the raw log before applying a predictive method, is of paramount importance to failure prediction and diagnosis. While existing filtering methods have demonstrated good compression rate, they fail to preserve important failure patterns that are crucial for failure analysis. To address the problem, in this paper we present a log preprocessing method. It consists of three integrated steps: (1) event categorization to uniformly classify system events and identify fatal events; (2) event filtering to remove temporal and spatial redundant records, while also preserving necessary failure patterns for failure analysis; (3) causality-related filtering to combine correlated events for filtering through apriori association rule mining. We demonstrate the effectiveness of our preprocessing method by using real failure logs collected from the Cray XT4 at ORNL and the Blue Gene/L system at SDSC. Experiments show that our method can preserve more failure patterns for failure analysis, thereby improving failure prediction by up to 174%.
  • Keywords
    data mining; fault tolerant computing; apriori association rule mining; causality-related filtering; event categorization; event filtering; failure analysis; failure logs; failure pattern; failure prediction; fatal event identification; spatial redundant records; system event classification; system log preprocessing; temporal redundant records; Association rules; Data analysis; Data mining; Failure analysis; Filtering; Information resources; Laboratories; Large-scale systems; Production systems; Productivity; Cray XT4; IBM Blue Gene/L; event categorization; event filtering; log preprocessing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable Systems & Networks, 2009. DSN '09. IEEE/IFIP International Conference on
  • Conference_Location
    Lisbon
  • Print_ISBN
    978-1-4244-4422-9
  • Electronic_ISBN
    978-1-4244-4421-2
  • Type

    conf

  • DOI
    10.1109/DSN.2009.5270289
  • Filename
    5270289