• DocumentCode
    3469144
  • Title

    Using physical and simulated fault injection to evaluate error detection mechanisms

  • Author

    Constantinescu, Cristian

  • Author_Institution
    Intel Corp., Hillsboro, OR, USA
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    186
  • Lastpage
    192
  • Abstract
    Effective error detection is paramount for building highly dependable computing systems. A new methodology, based on physical and simulated fault injection, is developed for evaluating error detection mechanisms. Our approach consists of two steps. First, transient faults are physically injected at the IC pin level of a prototype server. Experiments are carried our in a three dimensional space of events, the location, time of occurrence and duration of the fault being randomly selected. Improved detection circuitry is devised for decreasing signal sensitivity to transients. Second, simulated fault injection is performed to asses the effectiveness of the new detection mechanisms, without using expensive silicon implementations. Physical fault injection experiments, carried out on the server, and simulated fault injection, performed on protocol checker, are presented. Detection effectiveness is measured by the error detection coverage, defined as the conditional probability that an error is detected given that an error occurs. Fault injection reveals that coverage probability is a function of fault duration. The protocol checker significantly improves error detection. Although, further research is required to increase detection coverage of the errors induced by short transient faults
  • Keywords
    computer testing; error detection; fault tolerant computing; network servers; IC pin level; dependable computing systems; detection mechanisms; error detection coverage; error detection mechanisms; protocol checker; short transient faults; simulated fault injection; transient faults; Buildings; Circuit faults; Circuit simulation; Computational modeling; Computer architecture; Costs; Degradation; Electrical fault detection; Fault detection; Prototypes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable Computing, 1999. Proceedings. 1999 Pacific Rim International Symposium on
  • Print_ISBN
    0-7695-0371-3
  • Type

    conf

  • DOI
    10.1109/PRDC.1999.816228
  • Filename
    816228