DocumentCode :
2051806
Title :
Predicting failures of computer systems: a case study for a telecommunication system
Author :
Salfner, Felix ; Schieschke, Michael ; Malek, Miroslaw
Author_Institution :
Inst. fur Informatik, Humboldt-Univ. zu Berlin, Germany
fYear :
2006
fDate :
25-29 April 2006
Abstract :
The goal of online failure prediction is to forecast imminent failures while the system is running. This paper compares similar events prediction (SEP) with two other well-known techniques for online failure prediction: a straightforward method that is based on a reliability model and dispersion frame technique (DFT). SEP is based on recognition of failure-prone patterns utilizing a semi-Markov chain in combination with clustering. We applied the approaches to real data of a commercial telecommunication system. Results are presented in terms of precision, recall, F-measure and accumulated runtime-cost. The results suggest a significantly improved forecasting performance.
Keywords :
Markov processes; pattern recognition; software fault tolerance; telecommunication; F-measure; computer system prediction failures; dispersion frame technique; failure-prone pattern recognition; online failure prediction; reliability model; semiMarkov chain; similar events prediction; telecommunication system; Availability; Computer aided software engineering; Fault tolerance; Hardware; Pattern recognition; Predictive models; Software design; Software engineering; Steady-state; Telecommunication computing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
Print_ISBN :
1-4244-0054-6
Type :
conf
DOI :
10.1109/IPDPS.2006.1639672
Filename :
1639672
Link To Document :
بازگشت