DocumentCode
3447998
Title
Towards understanding the effects of intermittent hardware faults on programs
Author
Rashid, Layali ; Pattabiraman, Karthik ; Gopalakrishnan, Sathish
Author_Institution
Univ. of British Columbia, Vancouver, BC, Canada
fYear
2010
fDate
June 28 2010-July 1 2010
Firstpage
101
Lastpage
106
Abstract
Intermittent hardware faults are bursts of errors that last from a few CPU cycles to a few seconds. They are caused by process variations, circuit wear-out, and temperature, clock or voltage fluctuations. Recent studies show that intermittent fault rates are increasing due to technology scaling and are likely to be a significant concern in future systems. We study the propagation of intermittent faults to programs; in particular, we are interested in the crash behaviour of programs. We use a model of a program that represents the data dependencies in a fault-free trace of the program and we analyze this model to glean some information about the length of intermittent faults and their effect on the program under specific fault and crash models. The results of our study can aid fault detection, diagnosis and recovery techniques.
Keywords
fault diagnosis; fault tolerant computing; program diagnostics; CPU cycle; circuit wear out; data dependency; fault detection; fault diagnosis; fault free trace; intermittent hardware fault; process variation; recovery technique; voltage fluctuation; Circuit faults; Clocks; Computer crashes; Computer errors; Fault detection; Fault diagnosis; Hardware; Information analysis; Temperature; Voltage fluctuations;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable Systems and Networks Workshops (DSN-W), 2010 International Conference on
Conference_Location
Chicago, IL
Print_ISBN
978-1-4244-7729-6
Electronic_ISBN
978-1-4244-7728-9
Type
conf
DOI
10.1109/DSNW.2010.5542613
Filename
5542613
Link To Document