Title :
Transient Fault Prediction Based on Anomalies in Processor Events
Author :
Narayanasamy, Satish ; Coskun, Ayse K. ; Calder, Brad
Author_Institution :
California Univ., San Diego, CA
Abstract :
Future microprocessors will be highly susceptible to transient errors as the sizes of transistors decrease due to CMOS scaling. Prior techniques advocated full scale structural or temporal redundancy to achieve fault tolerance. Though they can provide complete fault coverage, they incur significant hardware and/or performance cost. It is desirable to have mechanisms that can provide partial but sufficiently high fault coverage with negligible cost. To meet this goal, we propose leveraging speculative structures that already exist in modern processors. The proposed mechanism is based on the insight that when a fault occurs, it is likely that the incorrect execution would result in abnormally higher or lower number of mispredictions (branch mispredictions, L2 misses, store set mispredictions) than a correct execution. We design a simple transient fault predictor that detects the anomalous behavior in the outcomes of the speculative structures to predict transient faults
Keywords :
CMOS integrated circuits; fault tolerance; microprocessor chips; redundancy; transistors; CMOS scaling; L2 misses; branch mispredictions; fault tolerance; leveraging speculative structures; microprocessors; store set mispredictions; temporal redundancy; transient errors; transient fault prediction; transistors; CMOS process; CMOS technology; Costs; Fault detection; Fault tolerance; Hardware; Microprocessors; Noise level; Noise reduction; Redundancy;
Conference_Titel :
Design, Automation & Test in Europe Conference & Exhibition, 2007. DATE '07
Conference_Location :
Nice
Print_ISBN :
978-3-9810801-2-4
DOI :
10.1109/DATE.2007.364448