• DocumentCode
    3548704
  • Title

    Program fault tolerance based on memory access behavior

  • Author

    Bowen, N.S. ; Pradhan, D.K.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    1991
  • fDate
    25-27 June 1991
  • Firstpage
    426
  • Lastpage
    433
  • Abstract
    Fault observability based on the behavior of the memory references is studied. As opposed to traditional studies that view memory as one large entity that must completely work to be considered reliable, this study emphasizes the usage patterns of a particular program´s memory. Expressions for the successful execution of a program that takes into account the usage of the data are developed. Three variations that depend on whether the program´s storage is pre-allocated, dynamically allocated, or constrained in allocation are presented. A theory is proposed to explain the phenomenon that increased workloads lead to increased failure rates, which has been observed in several studies. The model is used to study several program traces, and is shown that increased workloads could cause an increase of the observed failure rates in the range of 27% to 53%.<>
  • Keywords
    fault tolerant computing; program testing; fault observability; memory access behavior; memory references; program fault tolerance; program traces; Database systems; Delay; Error correction codes; Failure analysis; Fault tolerance; Observability; Performance analysis; Reliability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault-Tolerant Computing, 1991. FTCS-21. Digest of Papers., Twenty-First International Symposium
  • Conference_Location
    Montreal, Quebec, Canada
  • Print_ISBN
    0-8186-2150-8
  • Type

    conf

  • DOI
    10.1109/FTCS.1991.146696
  • Filename
    146696