• DocumentCode
    3043462
  • Title

    Simulation of software behavior under hardware faults

  • Author

    Goswami, Kumar K. ; Iyer, Ravishankar K.

  • Author_Institution
    Center for Reliable & High-Performance Comput., Illinois Univ., Urbana, IL, USA
  • fYear
    1993
  • fDate
    22-24 June 1993
  • Firstpage
    218
  • Lastpage
    227
  • Abstract
    A simulation-based software-model that permits application specific dependability analysis in the early design stages is introduced. The model represents an application program by decomposing it into a graph model consisting of a set of nodes, a set of edges that probabilistically determine the flow from node to node, and a mapping of the nodes to memory. The software model simulates the execution of the program while errors are injected into the program´s memory space. The model provides application-dependent parameters such as detection and propagation times and permits evaluation of function on system level error detection and recovery schemes. A case study illustrates the interaction between an application program and two detection schemes. Specifically, Gaussian elimination programs running on a Tandem Integrity S2 system with memory scrubbing are studied. Results from the simulation-based software model are validated with data measured from an actual Tandem Integrity S2 system. Application dependent coverage values obtained with the model are compared with those obtained via traditional schemes that assume uniform or ramp memory access patterns. For the authors´ program, some coverage values obtained with the traditional approaches were found to be 100% larger than those obtained with the software model.
  • Keywords
    software fault tolerance; Gaussian elimination programs; Tandem Integrity S2 system; application program; application specific dependability analysis; application-dependent parameters; graph model; hardware faults; memory scrubbing; ramp memory access patterns; simulation-based software-model; software behaviour simulation; software model; Analytical models; Application software; Availability; Computational modeling; Delay; Hardware; NASA; Random access memory; Software reliability; Software systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault-Tolerant Computing, 1993. FTCS-23. Digest of Papers., The Twenty-Third International Symposium on
  • Conference_Location
    Toulouse, France
  • ISSN
    0731-3071
  • Print_ISBN
    0-8186-3680-7
  • Type

    conf

  • DOI
    10.1109/FTCS.1993.627325
  • Filename
    627325