Title :
Multiresolution Abnormal Trace Detection Using Varied-Length n-Grams and Automata
Author :
Jiang, Guofei ; Chen, Haifeng ; Ungureanu, Cristian ; Yoshihira, Kenji
Author_Institution :
NEC Labs. America, Princeton, NJ
Abstract :
Detection and diagnosis of faults in a large-scale distributed system is a formidable task. Interest in monitoring and using traces of user requests for fault detection has been on the rise recently. In this paper we propose novel fault detection methods based on abnormal trace detection. One essential problem is how to represent the large amount of training trace data compactly as an oracle. Our key contribution is the novel use of varied-length n-grams and automata to characterize normal traces. A new trace is compared against the learned automata to determine whether it is abnormal. We develop algorithms to automatically extract n-grams and construct multiresolution automata from training data. Further, both deterministic and multihypothesis algorithms are proposed for detection. We inspect the trace constraints of real application software and verify the existence of long n-grams. Our approach is tested in a real system with injected faults and achieves good results in experiments
Keywords :
Internet; automata theory; fault diagnosis; fault tolerant computing; learning (artificial intelligence); Oracle; deterministic algorithms; fault detection; fault diagnosis; large-scale distributed system; learned automata; multihypothesis algorithms; multiresolution abnormal trace detection; real system; trace data training; using varied-length n-grams; Application software; Debugging; Fault detection; Fault diagnosis; Information systems; Large-scale systems; Learning automata; Middleware; Monitoring; Web and internet services; n-gram; Abnormal trace; algorithm; automata; fault detection; large-scale information system;
Journal_Title :
Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on
DOI :
10.1109/TSMCC.2006.871569