Title :
Computing the Number of Calls Dropped Due to Failures
Author :
Trivedi, Kishor S. ; Wang, Dazhi ; Hunt, Jason
Author_Institution :
Dept. of Electr. & Comput. Eng., Duke Univ., Durham, NC, USA
Abstract :
Defects per million (DPM), defined as the number of calls out of a million dropped due to failures, is an important service (un)reliability measure for telecommunication systems. Most previous research derives the DPM from steady-state system availability model. In this paper, we develop a novel method for DPM computation which takes into consideration not only system availability, but also the impact of service application as well as the transient behavior of failure recovery. We illustrate this approach using a real system which is the IBM SIP SLEE cluster. Our method takes into account software/hardware failures, different stages of recovery, different phases of call flow, retry attempts and the interactions between call flow and failure/recovery behavior.
Keywords :
fault tolerance; signalling protocols; telecommunication computing; telecommunication network reliability; DPM computation; IBM SIP SLEE cluster; call flow; defect per million; dropped call; failure recovery; hardware failure; software failure; steady state system availability model; telecommunication service reliability measure; Availability; Blades; Computational modeling; Maintenance engineering; Manuals; Peer to peer computing; Servers; Imperfect coverage; Markov chain; Session Initiation Protocol; Software Fault Tolerance; Voice over IP; user-perceived reliability;
Conference_Titel :
Software Reliability Engineering (ISSRE), 2010 IEEE 21st International Symposium on
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-9056-1
Electronic_ISBN :
1071-9458
DOI :
10.1109/ISSRE.2010.18