• DocumentCode
    3275397
  • Title

    Multi-failure fault-tolerance of embedded loops on hypercubes: issues and performance study

  • Author

    Liang, C.T. ; Tsai, W.T.

  • Author_Institution
    Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
  • fYear
    1990
  • fDate
    9-13 Dec 1990
  • Firstpage
    511
  • Lastpage
    518
  • Abstract
    The authors study the multi-failure fault-tolerance of hypercubes. Reconfiguration algorithms are proposed to reallocate the function of failed nodes to spare nodes so the communication structure of the interrupted parallel algorithms is preserved. Both clustered fault and concurrent fault are considered. Loops are selected as the embedded communication structures, where a wide variety of applications have been implemented. In early work, two classes of fault-tolerant embedded loops, Mapping II and III, have been designed and proved one-step reconfigurable for any single failure. The authors derive from shortest path algorithms a distributed reconfiguration algorithm for multiple failures on these embedded loops. Proof of reconfigurability for clustered fault is conducted for Mapping III. Performance of both mappings is evaluated by simulation with parameters such as the average number of tolerable failures, the average number of job migrations, and the utilization rate of nodes
  • Keywords
    fault tolerant computing; hypercube networks; parallel algorithms; performance evaluation; Mapping II; Mapping III; clustered fault; concurrent fault; distributed reconfiguration algorithm; embedded communication structures; embedded loops; hypercubes; multi-failure fault-tolerance; parallel algorithms; performance study; shortest path; Clustering algorithms; Computational modeling; Computer networks; Computer science; Fault tolerance; Hardware; Hypercubes; Large-scale systems; Network topology; Parallel algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing, 1990. Proceedings of the Second IEEE Symposium on
  • Conference_Location
    Dallas, TX
  • Print_ISBN
    0-8186-2087-0
  • Type

    conf

  • DOI
    10.1109/SPDP.1990.143594
  • Filename
    143594