• DocumentCode
    2257464
  • Title

    Optimal fault-tolerant routing in hypercubes using extended safety vectors

  • Author

    Wu, Jie ; Gao, Feng ; Li, Zhongcheng ; Min, Yinghua

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Florida Atlantic Univ., Boca Raton, FL, USA
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    264
  • Lastpage
    271
  • Abstract
    Reliable communication in cube-based multicomputers using the extended safety vector concept is studied. Each node in a cube-based multicomputer of dimension n is assorted with an extended safety vector of n bits, which is an approximated measure of the number and distribution of faults in the neighborhood. In the extended safety vector model, each node knows fault information within distance-2 and fault information outside distance-2 is coded in a special way based on the coded information of its neighbors. The extended safety vector of each node can be easily calculated through n-1 rounds of information exchanges among neighboring nodes. Optimal unicasting between two nodes is guaranteed if the kth bit of the safety vector of the source node is one, where k is the Hamming distance between the source and destination nodes. In addition, the extended safety vector can be used as a navigation tool to direct a message to its destination through a minimal path. Simulation results show a significant improvement in terms of optimal routing capability in a hypercube with faulty links using the proposed model, compared with the one using the original safety vector model
  • Keywords
    fault tolerant computing; hypercube networks; multiprocessing systems; safety; Hamming distance; approximated measure; coded information; cube-based multicomputers; extended safety vectors; fault information; faulty links; hypercube; hypercubes; information exchanges; minimal path; navigation tool; neighboring nodes; optimal fault-tolerant routing; optimal routing capability; optimal unicasting; original safety vector model; reliable communication; safety vector; Computer science; Fault tolerance; Hamming distance; Hypercubes; Navigation; Prototypes; Reliability engineering; Routing; Safety; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems, 2000. Proceedings. Seventh International Conference on
  • Conference_Location
    Iwate
  • ISSN
    1521-9097
  • Print_ISBN
    0-7695-0568-6
  • Type

    conf

  • DOI
    10.1109/ICPADS.2000.857707
  • Filename
    857707