• DocumentCode
    734990
  • Title

    Failure mitigation in linear, sesquilinear and bijective operations on integer data streams via numerical entanglement

  • Author

    Anam, Mohammad Ashraful ; Andreopoulos, Yiannis

  • Author_Institution
    Electron. & Electr. Eng. Dept., Univ. Coll. London, London, UK
  • fYear
    2015
  • fDate
    6-8 July 2015
  • Firstpage
    122
  • Lastpage
    127
  • Abstract
    A new roll-forward technique is proposed that recovers from any single fail-stop failure in M integer data streams (M ≥ 3) when undergoing linear, sesquilinear or bijective (LSB) operations, such as: scaling, additions/subtractions, inner or outer vector products and permutations. In the proposed approach, the M input integer data streams are linearly superimposed to form M numerically entangled integer data streams that are stored inplace of the original inputs. A series of LSB operations can then be performed directly using these entangled data streams. The output results can be extracted from any M-1 entangled output streams by additions and arithmetic shifts, thereby guaranteeing robustness to a fail-stop failure in any single stream computation. Importantly, unlike other methods, the number of operations required for the entanglement, extraction and recovery of the results is linearly related to the number of the inputs and does not depend on the complexity of the performed LSB operations. We have validated our proposal in an Intel processor (Haswell architecture with AVX2 support) via convolution operations. Our analysis and experiments reveal that the proposed approach incurs only 1.8% to 2.8% reduction in processing throughput in comparison to the failure-intolerant approach. This overhead is 9 to 14 times smaller than that of the equivalent checksum-based method. Thus, our proposal can be used in distributed systems and unreliable processor hardware, or safety-critical applications, where robustness against fail-stop failures becomes a necessity.
  • Keywords
    distributed processing; fault tolerant computing; microprocessor chips; numerical analysis; Intel processor; LSB operation; bijective operation; convolution operation; fail-stop failure; failure mitigation; integer data stream; least sequential bit operation; linear operation; numerical entanglement; processing throughput; roll-forward technique; sesquilinear operation; Complexity theory; Convolution; Dynamic range; Indexes; Kernel; Proposals; Reliability; algorithm-based fault tolerance; fail-stop failure; linear operations; numerical entanglement; sum-of-products;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    On-Line Testing Symposium (IOLTS), 2015 IEEE 21st International
  • Conference_Location
    Halkidiki
  • Type

    conf

  • DOI
    10.1109/IOLTS.2015.7229844
  • Filename
    7229844