DocumentCode
2923352
Title
Assessing time coalescence techniques for the analysis of supercomputer logs
Author
Martino, Catello Di ; Cinque, Marcello ; Cotroneo, Domenico
Author_Institution
Center for Reliable & High-Performance Comput., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear
2012
fDate
25-28 June 2012
Firstpage
1
Lastpage
12
Abstract
This paper presents a novel approach to assess time coalescence techniques. These techniques are widely used to reconstruct the failure process of a system and to estimate dependability measurements from its event logs. The approach is based on the use of automatically generated logs, accompanied by the exact knowledge of the ground truth on the failure process. The assessment is conducted by comparing the presumed failure process, reconstructed via coalescence, with the ground truth. We focus on supercomputer logs, due to increasing importance of automatic event log analysis for these systems. Experimental results show how the approach allows to compare different time coalescence techniques and to identify their weaknesses with respect to given system settings. In addition, results revealed an interesting correlation between errors caused by the coalescence and errors in the estimation of dependability measurements.
Keywords
parallel machines; storage management; system recovery; automatic event log analysis; automatically generated logs; data coalescence; dependability measurement estimation; errors; failure process reconstruction; ground truth; supercomputer dependability; supercomputer log analysis; time coalescence technique assessment; Computational modeling; Generators; Libraries; Logic gates; Software; Supercomputers; Writing; Event Log Analysis; data coalescence; dependability assessment; supercomputer dependability;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable Systems and Networks (DSN), 2012 42nd Annual IEEE/IFIP International Conference on
Conference_Location
Boston, MA
ISSN
1530-0889
Print_ISBN
978-1-4673-1624-8
Electronic_ISBN
1530-0889
Type
conf
DOI
10.1109/DSN.2012.6263946
Filename
6263946
Link To Document