• DocumentCode
    1161054
  • Title

    Optimal, and reliable communication in hypercubes using extended safety vectors

  • Author

    Wu, Jie ; Gao, Feng ; Li, Zhongcheng ; Min, Yinghua

  • Author_Institution
    Dept. of Comput. Sci., Florida Atlantic Univ., Boca Raton, FL, USA
  • Volume
    54
  • Issue
    3
  • fYear
    2005
  • Firstpage
    402
  • Lastpage
    411
  • Abstract
    We propose a new coding method of limited global fault information in an n-cube. First, each node collects precise fault information within distance-d, and then fault information about nodes that are more than distance-d away is coded in a special way. Specifically, in our approach, each node in a cube-based multicomputer of dimension n is associated with an extended safety vector of n bits. In the extended safety vector model, each node knows fault information within distance-2; fault information outside distance-2 is coded in a special way based on the coded information of its neighbors. The extended safety vector of each node can be easily calculated through n-1 rounds of information exchanges among neighboring nodes. Therefore, each extended safety vector is an approximated measure of the number & distribution of faults in the neighborhood. Optimal unicasting between two nodes is guaranteed if the kth bit of the safety vector of the source node is one, where k is the Hamming distance between the source & destination nodes. In addition, the extended safety vector can be used as a navigation tool to direct a message to its destination through a minimal path. A simulation study has been conducted based on different selections of d, and results have shown a significant improvement under the proposed model over the safety vector model in handling link faults, even for a small value of d as in the extended safety vector model where d=2.
  • Keywords
    encoding; fault tolerant computing; hypercube networks; safety systems; Hamming distance; coding method; cube-based multicomputer; extended safety vector; fault-tolerant routing; hypercube; link fault; reliable communication; Computer architecture; Computer science; Hamming distance; Hypercubes; Laboratories; Navigation; Network topology; Routing; Safety; Very large scale integration; Fault-tolerant routing; generalized hypercubes; multicomputers; reliable communication; safety vectors;
  • fLanguage
    English
  • Journal_Title
    Reliability, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9529
  • Type

    jour

  • DOI
    10.1109/TR.2005.853439
  • Filename
    1505045