DocumentCode
584533
Title
An Offline Analysis Framework of Cluster System Logs
Author
Yang, Bowen ; Xu, Jungang
Author_Institution
Sch. of Inf. Sci. & Eng., Grad. Univ. of Chinese Acad. of Sci., Beijing, China
fYear
2012
fDate
11-13 Aug. 2012
Firstpage
1724
Lastpage
1728
Abstract
The growth of computing and storage needs of several scientific applications mandate the deployment of extreme-scale parallel machines, such as Blue Gene/L, Spirit, Liberty, Red Storm and etc. One of the challenges when designing and deploying these systems in a production setting is the need to take failure occurrences into account. In this paper, an offline analysis framework of cluster system logs is designed and implemented which includes four main parts, such as log formatting module, log filtering module, a central database and log mining module. Finally, four cluster system logs are analyzed in the temporal and spatial statistical characteristics of error events.
Keywords
data mining; system monitoring; temporal databases; visual databases; Blue Gene-L; Liberty; Red Storm; Spirit; central database; cluster system logs; error events; extreme-scale parallel machines; failure occurrences; log filtering module; log formatting module; log mining module; offline analysis framework; scientific applications; spatial statistical characteristics; temporal statistical characteristics; Computers; Conferences; Databases; Educational institutions; Failure analysis; Filtering; Reliability; cluster system; log analysis; regular expression;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science & Service System (CSSS), 2012 International Conference on
Conference_Location
Nanjing
Print_ISBN
978-1-4673-0721-5
Type
conf
DOI
10.1109/CSSS.2012.431
Filename
6394750
Link To Document