Title :
Adaptive Profiling for Root-Cause Analysis of Performance Anomalies in Web-Based Applications
Author :
Magalhães, João Paulo ; Silva, Luis Moura
Author_Institution :
CIICESI, ESTGF-Porto Polytech. Inst., Felgueiras, Portugal
Abstract :
The most important factor in the assessment of the availability of a system is the mean-time to repair (MTTR). The lower the MTTR the higher the availability. A significant portion of the MTTR is spent in the detection and localization of the cause of the failure. One possible method that may provide good results in the root-cause analysis of application failures is run-time profiling. The major drawback of run-time profiling is the performance impact. In this paper we describe two algorithms for selective and adaptive profiling of web-based applications. The algorithms make use of a dynamic profiling interval and are mainly triggered when some of the transactions start presenting some symptoms of performance anomaly. The algorithms were tested under different types of degradation scenarios and compared to static sampling strategies. We observed through experimentation that the pinpoint of performance anomalies, supported by the data collected using the adaptive profiling algorithms, stills timely as with full-profiling while the response time overhead is reduced in almost 60%. When compared to a non-profiled version the response time overhead is less than 1.5%. These results show the viability of using run-time profiling to support quickly detection and pinpointing of performance anomalies and enable timely recovery.
Keywords :
Internet; system recovery; MTTR; Web-based application; adaptive profiling algorithm; application failures; dynamic profiling interval; mean time to repair; performance anomaly pinpointing; response time overhead reduction; root-cause analysis; run-time profiling; selective profiling; system recovery; Algorithm design and analysis; Correlation; Degradation; Heuristic algorithms; Monitoring; Servers; Time factors; application profiling; dependability; monitoring; performance anomalies; root-cause analysis;
Conference_Titel :
Network Computing and Applications (NCA), 2011 10th IEEE International Symposium on
Conference_Location :
Cambridge, MA
Print_ISBN :
978-1-4577-1052-0
Electronic_ISBN :
978-0-7695-4489-2
DOI :
10.1109/NCA.2011.30