• DocumentCode
    3549441
  • Title

    A data-centric approach to checksum reuse for array-intensive applications

  • Author

    Chen, G. ; Kandemir, M. ; Karakoy, M.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
  • fYear
    2005
  • fDate
    28 June-1 July 2005
  • Firstpage
    316
  • Lastpage
    325
  • Abstract
    Soft errors are transient faults that occur in VLSI circuits due to external radiation and affect the logic states of sensitive components. While many systems implement hardware-based protection techniques like ECC and other approaches to ensure an acceptable level of robustness against these errors, such solutions are generally very rigid and costly. Recent research discussed checksum-based software solutions that can be used in the context of array-intensive computations. While a checksum-based scheme can be more flexible than a hardware-based approach to reliability, it can also bring significant runtime overheads. Focusing on array-intensive applications, this paper proposes a compiler-directed data-centric strategy that maximizes reuse of checksums. A unique characteristic of the proposed scheme is that it can work with a given checksum assignment, and automatically - under compiler guidance - restructures the entire application code to maximize checksum reuse. This scheme can reduce checksum recomputing even further by inter-procedural checksum reuse. Our experiments clearly show that the proposed approach reduces the number of checksum calculations required by the previous work.
  • Keywords
    VLSI; error detection; fault tolerant computing; optimising compilers; parallel processing; transients; ECC; VLSI circuits; application code; array-intensive computations; checksum-based software solutions; compiler-directed data-centric strategy; hardware-based protection techniques; interprocedural checksum reuse; soft errors; transient faults; Application software; Circuit faults; Error correction codes; Logic circuits; Protection; Robustness; Runtime; Very large scale integration;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable Systems and Networks, 2005. DSN 2005. Proceedings. International Conference on
  • Print_ISBN
    0-7695-2282-3
  • Type

    conf

  • DOI
    10.1109/DSN.2005.5
  • Filename
    1467806