DocumentCode
3515642
Title
System log pre-processing to improve failure prediction
Author
Zheng, Ziming ; Lan, Zhiling ; Park, Byung H. ; Geist, Al
Author_Institution
Illinois Inst. of Technol., Chicago, TN, USA
fYear
2009
fDate
June 29 2009-July 2 2009
Firstpage
572
Lastpage
577
Abstract
Log preprocessing, a process applied on the raw log before applying a predictive method, is of paramount importance to failure prediction and diagnosis. While existing filtering methods have demonstrated good compression rate, they fail to preserve important failure patterns that are crucial for failure analysis. To address the problem, in this paper we present a log preprocessing method. It consists of three integrated steps: (1) event categorization to uniformly classify system events and identify fatal events; (2) event filtering to remove temporal and spatial redundant records, while also preserving necessary failure patterns for failure analysis; (3) causality-related filtering to combine correlated events for filtering through apriori association rule mining. We demonstrate the effectiveness of our preprocessing method by using real failure logs collected from the Cray XT4 at ORNL and the Blue Gene/L system at SDSC. Experiments show that our method can preserve more failure patterns for failure analysis, thereby improving failure prediction by up to 174%.
Keywords
data mining; fault tolerant computing; apriori association rule mining; causality-related filtering; event categorization; event filtering; failure analysis; failure logs; failure pattern; failure prediction; fatal event identification; spatial redundant records; system event classification; system log preprocessing; temporal redundant records; Association rules; Data analysis; Data mining; Failure analysis; Filtering; Information resources; Laboratories; Large-scale systems; Production systems; Productivity; Cray XT4; IBM Blue Gene/L; event categorization; event filtering; log preprocessing;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable Systems & Networks, 2009. DSN '09. IEEE/IFIP International Conference on
Conference_Location
Lisbon
Print_ISBN
978-1-4244-4422-9
Electronic_ISBN
978-1-4244-4421-2
Type
conf
DOI
10.1109/DSN.2009.5270289
Filename
5270289
Link To Document