Title of article
Event storm detection and identification in communication systems
Author/Authors
Mouayad Albaghdadi، نويسنده , , Bruce Briley، نويسنده , , Martha Evens، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2006
Pages
12
From page
602
To page
613
Abstract
Event storms are the manifestation of an important class of abnormal behaviors in communication systems. They occur when a large number of nodes throughout the system generate a set of events within a small period of time. It is essential for network management systems to detect every event storm and identify its cause, in order to prevent and repair potential system faults.
This paper presents a set of techniques for the effective detection and identification of event storms in communication systems. First, we introduce a new algorithm to synchronize events to a single node in the system. Second, the systemʹs event log is modeled as a normally distributed random process. This is achieved by using data analysis techniques to explore and then model the statistical behavior of the event log. Third, event storm detection is proposed using a simple test statistic combined with an exponential smoothing technique to overcome the non-stationary behavior of event logs. Fourth, the system is divided into non-overlapping regions to locate the main contributing regions of a storm. We show that this technique provides us with a method for event storm identification. Finally, experimental results from a commercially deployed multimedia communication system that uses these techniques demonstrate their effectiveness.
Keywords
Event storms , Exploratory data analysis , Fault detection , Fault identification , Fault management , Network management , Wireless networks
Journal title
Reliability Engineering and System Safety
Serial Year
2006
Journal title
Reliability Engineering and System Safety
Record number
1187461
Link To Document