DocumentCode
548528
Title
Filtering error log as time series in complex service-based storage systems
Author
Rao, Xiang ; Yin, Gang ; Wang, Huaimin ; Shi, Dianxi ; Zhu, Yanxu
Author_Institution
Nat. Lab. for Parallel & Distrib. Process., Nat. Univ. of Defense Technol., Changsha, China
fYear
2011
fDate
21-23 June 2011
Firstpage
226
Lastpage
231
Abstract
Mining log pattern to analyze the faults in large scale distributed system is affected by the existence of redundant and ambiguous noisy error logs. While existing works try to compress logs in a coarse granularity from temporal and spatial view to remove the redundancy, they fail to reserve those ambiguous logs that might truly relate to a fault, which misleads the fault characterizing result. By modeling error logs as time series and examining the similarity between trash error log template and target error log, the ambiguous error logs are kept and the affected patterns can be effectively removed. Experiments in a practical complex service-based storage show that up to 92% of the affected patterns can be filtered.
Keywords
data mining; distributed databases; information filtering; large-scale systems; storage allocation; time series; affected patterns; ambiguous error logs; ambiguous logs; ambiguous noisy error logs; coarse granularity; complex service-based storage systems; fault characterizing result; filtering error log; large scale distributed system; mining log pattern; redundant error logs; target error log; time series; trash error log template; Approximation methods; Computer crashes; Libraries; Matched filters; Time series analysis; Transforms; Service-based storage system; log filtering; trash error logs;
fLanguage
English
Publisher
ieee
Conference_Titel
Networked Computing and Advanced Information Management (NCM), 2011 7th International Conference on
Conference_Location
Gyeongju
Print_ISBN
978-1-4577-0185-6
Electronic_ISBN
978-89-88678-37-4
Type
conf
Filename
5967550
Link To Document