DocumentCode
909287
Title
Reliability analysis in distributed systems
Author
Raghavendra, C.S. ; Kumar, V. K Prasanna ; Hariri, Salim
Author_Institution
Dept. of Electr. Eng.-Syst., Univ. of Southern California, Los Angeles, CA, USA
Volume
37
Issue
3
fYear
1988
fDate
3/1/1988 12:00:00 AM
Firstpage
352
Lastpage
358
Abstract
Reliability of a distributed processing system is an important design parameter that can be described in terms of the reliability of processing elements and communication links and also of the redundancy of programs and data files. The traditional terminal-pair reliability does not capture the redundancy of programs and files in a distributed system. Two reliability measures are introduced: distributed program reliability, which describes the probability of successful execution of a program requiring cooperation of several computers, and distributed system reliability, which is the probability that all the specified distributed programs for the system are operational. These two reliability measures can be extended to incorporate the effects of user sites on reliability. An efficient approach based on graph traversal is developed to evaluate the proposed reliability measures
Keywords
distributed processing; fault tolerant computing; communication links; data files; design parameter; distributed program reliability; distributed systems; graph traversal; redundancy; Computer network reliability; Distributed computing; Distributed processing; Intelligent networks; Load management; Redundancy; Resource management; Surface-mount technology; Telecommunication network reliability; Tree graphs;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/12.2173
Filename
2173
Link To Document