Title :
Problem Determination in Enterprise Middleware Systems using Change Point Correlation of Time Series Data
Author :
Agarwal, Manoj K. ; Gupta, Manish ; Mann, Vijay ; Sachindran, Narendran ; Anerousis, Nikos ; Mummert, Lily
Author_Institution :
IBM India Res. Lab, New Delhi
Abstract :
Clustered enterprise middleware systems employing dynamic workload scheduling are susceptible to a variety of application malfunctions that can manifest themselves in a counterintuitive fashion and cause debilitating damage. Until now, diagnosing problems in that domain involves investigating log files and configuration settings and requires in-depth knowledge of the middleware architecture and application design. This paper presents a method for problem determination using change point detection techniques and problem signatures consisting of a combination of changes (or absence of changes) in different metrics. We implemented this approach on a clustered middleware system and applied it to the detection of the storm drain condition: a debilitating problem encountered in clustered systems with counterintuitive symptoms. Our experimental results show that the system detects 93% of storm drain faults with no false positives
Keywords :
business data processing; middleware; scheduling; time series; change point correlation; change point detection techniques; clustered systems; counterintuitive symptoms; dynamic workload scheduling; enterprise middleware systems; time series data; Degradation; Delay; Dynamic scheduling; Fluctuations; Hardware; Manuals; Middleware; Monitoring; Performance loss; Storms; Change Point Detection; Health Monitoring; Problem Determination; Storm Drain;
Conference_Titel :
Network Operations and Management Symposium, 2006. NOMS 2006. 10th IEEE/IFIP
Conference_Location :
Vancouver, BC
Print_ISBN :
1-4244-0142-9
DOI :
10.1109/NOMS.2006.1687576