• DocumentCode
    3178042
  • Title

    Dependability analysis of a commercial high-speed network

  • Author

    Stott, D.T. ; Hsueh, M.-C. ; Ries, G.L. ; Iyer, R.K.

  • Author_Institution
    Coordinated Sci. Lab., Illinois Univ., Urbana, IL, USA
  • fYear
    1997
  • fDate
    24-27 June 1997
  • Firstpage
    248
  • Lastpage
    257
  • Abstract
    The paper presents an injection-based approach to analyze dependability of high-speed networks using the Myrinet as an example testbed. Instead of injecting faults related to network protocols, the authors injected faults into the host interface component, which performs the actual send and receive operations. The fault model used was a temporary single bit flip in an instruction executing on the host interface´s custom processor, corresponding to a transient fault in the processor itself. Results show that more than 25% of the injected faults resulted in interface failures. Furthermore, they observed fault propagation from an interface to its host computer or to another interface to which it sent a message. These findings suggest that two important issues for high-speed networking in critical applications are protecting the host computer from errant or malicious interface components and implementing thorough message acceptance test mechanisms to prevent errant messages from propagating faults between interfaces.
  • Keywords
    computer network reliability; fault tolerant computing; local area networks; network interfaces; Myrinet; commercial high-speed network; dependability analysis; errant interface component protection; errant messages; fault injection; fault model; fault propagation; host interface component; host interface custom processor; injection-based approach; instruction; interface failures; malicious interface component protection; message acceptance test mechanisms; receive operations; send operations; temporary single bit flip; transient fault; Application software; Computer networks; Electronic mail; Ethernet networks; Hardware; High-speed networks; Protection; Protocols; Switches; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault-Tolerant Computing, 1997. FTCS-27. Digest of Papers., Twenty-Seventh Annual International Symposium on
  • Conference_Location
    Seattle, WA, USA
  • ISSN
    0731-3071
  • Print_ISBN
    0-8186-7831-3
  • Type

    conf

  • DOI
    10.1109/FTCS.1997.614097
  • Filename
    614097