DocumentCode
8148
Title
Low-Cost Concurrent Error Detection for Floating-Point Unit (FPU) Controllers
Author
Maniatakos, Michail ; Kudva, Prabhakar ; Fleischer, B.M. ; Makris, Yiorgos
Author_Institution
Dept. of Electr. Eng., New York Univ. Abu Dhabi, Abu Dhabi, United Arab Emirates
Volume
62
Issue
7
fYear
2013
fDate
Jul-13
Firstpage
1376
Lastpage
1388
Abstract
We present a nonintrusive concurrent error detection (CED) method for protecting the control logic of a contemporary floating-point unit (FPU). The proposed method is based on the observation that control logic errors lead to extensive data path corruption and affect, with high probability, the exponent part of the IEEE-754 floating-point representation. Thus, exponent monitoring can be utilized to detect errors in the control logic of the FPU. Predicting the exponent involves relatively simple operations; therefore, our method incurs significantly lower overhead than the classical approach of duplicating the control logic of the FPU. Indeed, experimental results on the openSPARC T1 processor using SPEC2006FP benchmarks show that as compared to control logic duplication, which incurs an area overhead of 17.9 percent of the FPU size, our method incurs an area overhead of only 5.8 percent yet still achieves detection of over 93 percent of transient errors in the FPU control logic. Moreover, the proposed method offers the ancillary benefit of also detecting 98.1 percent of the data path errors that affect the exponent, which cannot be detected via duplication of control logic. Finally, when combined with a classical residue code-based method for the fraction, our method leads to a complete CED solution for the entire FPU which provides a coverage of 94.1 percent of all errors at an area cost of 16.32 percent of the FPU size.
Keywords
floating point arithmetic; multiprocessing systems; CED method; FPU controller; IEEE-754 floating-point representation; SPEC2006FP benchmark; contemporary floating-point unit; control logic protection; data path corruption; exponent monitoring; nonintrusive concurrent error detection method; openSPARC T1 processor; residue code-based method; Adders; Hardware; Microprocessors; Monitoring; Pipelines; Software; Transient analysis; Adders; CED method; Error correction; FPU controller; Hardware; IEEE-754; IEEE-754 floating-point representation; Microprocessors; Monitoring; Pipelines; SPEC2006FP benchmark; Software; Transient analysis; contemporary floating-point unit; control logic; control logic protection; data path corruption; exponent monitoring; floating point; floating point arithmetic; multiprocessing systems; nonintrusive concurrent error detection method; openSPARC T1 processor; residue code-based method;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/TC.2012.81
Filename
6178236
Link To Document