DocumentCode :
3445265
Title :
A Probabilistic Characterization of Fault Rings in Adaptively-Routed Mesh Interconnection Networks
Author :
Safaei, F. ; Khonsari, A. ; Dadlani, A. ; Ould-Khaoua, M.
Author_Institution :
Sch. of Comput. Sci., IPM, Tehran
fYear :
2008
fDate :
7-9 May 2008
Firstpage :
233
Lastpage :
238
Abstract :
With increase in concern for reliability in the current and next generation of multiprocessors system-on-chip (MP-SoCs), multi-computers, cluster computers, and peer-to-peer communication networks, fault-tolerance has become an integral part of these systems. One of the fundamental issues regarding fault-tolerance is how to efficiently route a faulty network where each component is associated with some probability of failure. Adaptive fault-tolerant routing algorithms have been frequently suggested in the literature as means of improving communication performance and fault-tolerant demands in computer systems. Also, several results have been reported on usage of fault rings in providing detours to messages blocked by faults and in routing messages adaptively around the rectangular faulty regions. In order to analyze the performance of such routing schemes, one must investigate the characteristics of fault rings. In this paper, we derive mathematical expressions to compute the probability of message facing the fault rings in the well-known mesh interconnection network. We also conduct extensive simulation experiments using a variety of faults, the results of which are used to confirm the accuracy of the proposed models.
Keywords :
failure analysis; fault tolerance; multiprocessor interconnection networks; network routing; network topology; probability; adaptively-routed mesh interconnection network; cluster computer; failure probabilistic characterization; fault ring; fault-tolerant routing algorithm; multicomputer system; multiprocessor system-on-chip; peer-to-peer communication network; Computer network reliability; Computer networks; Fault tolerance; Fault tolerant systems; Multiprocessing systems; Multiprocessor interconnection networks; Next generation networking; Peer to peer computing; Routing; Telecommunication network reliability; Adaptive Routings; Fault Rings; Fault-tolerance; Interconnection Networks; Mesh;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Architectures, Algorithms, and Networks, 2008. I-SPAN 2008. International Symposium on
Conference_Location :
Sydney, NSW
ISSN :
1087-4089
Print_ISBN :
978-0-7695-3125-0
Type :
conf
DOI :
10.1109/I-SPAN.2008.17
Filename :
4520221
Link To Document :
بازگشت