DocumentCode
3354983
Title
Routing in modular fault tolerant multiprocessor systems
Author
Alam, M.S. ; Melhem, R.G.
Author_Institution
Dept. of Comput. Sci., Pittsburgh Univ., PA, USA
fYear
1992
fDate
8-10 July 1992
Firstpage
185
Lastpage
193
Abstract
The authors consider a class of modular multiprocessor architectures in which spares are added to each module to cover for faulty nodes within that module, thus forming a fault tolerant basic block (FTBB). The goal is to preserve the logical adjacency between active nodes by means of a routing algorithm which delivers messages successfully to their destinations. Two phase routing strategies are introduced that route messages first to their destination FTBB, and then to the destination nodes within the destination FTBB. This strategy may be applied to a variety of architectures including binary hypercubes and 3-D tori. In the presence of f faults in these systems. It is shown that the worst case length of the message route is max( sigma +f, (K+1) sigma )+M, where sigma is the shortest path in the absence of faults, and M and K are the numbers of primary nodes and spare nodes in a FTBB, respectively. The average routing overhead is much lower than the worst case overhead.<>
Keywords
fault tolerant computing; hypercube networks; multiprocessing systems; 3-D tori; binary hypercubes; fault tolerant basic block; faulty nodes; logical adjacency; modular fault tolerant multiprocessor systems; routing; routing algorithm; worst case length; Computer architecture; Computer science; Degradation; Fault tolerance; Fault tolerant systems; Hypercubes; Multiprocessing systems; Real time systems; Routing; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Fault-Tolerant Computing, 1992. FTCS-22. Digest of Papers., Twenty-Second International Symposium on
Conference_Location
Boston, MA, USA
Print_ISBN
0-8186-2875-8
Type
conf
DOI
10.1109/FTCS.1992.243601
Filename
243601
Link To Document