DocumentCode :
3370610
Title :
Application-Aware diagnosis of runtime hardware faults
Author :
Pellegrini, Andrea ; Bertacco, Valeria
Author_Institution :
Univ. of Michigan, Ann Arbor, MI, USA
fYear :
2010
fDate :
7-11 Nov. 2010
Firstpage :
487
Lastpage :
492
Abstract :
Extreme technology scaling in silicon devices drastically affects reliability, particularly because of runtime failures induced by transistor wearout. Current online testing mechanisms focus on testing all components in a microprocessor, including hardware that has not been exercised, and thus have high performance penalties. We propose a hybrid hardware/software online testing solution where components that are heavily utilized by the software application are tested more thoroughly and frequently. Thus, our online testing approach focuses on the processor units that affect application correctness the most, and it achieves high coverage while incurring minimal performance overhead. We also introduce a new metric, Application-Aware Fault Coverage, measuring a test´s capability to detect faults that might have corrupted the state or the output of an application. Test coverage is further improved through the insertion of observation points that augment the coverage of the testing system. By evaluating our technique on a Sun OpenSPARC T1, we show that our solution maintains high Application-Aware Fault Coverage while reducing the performance overhead of online testing by more than a factor of 2 when compared to solutions oblivious to application´s behavior. Specifically, we found that our solution can achieve 95% fault coverage while maintaining a minimal performance overhead (1.3%) and area impact (0.4%).
Keywords :
microprocessor chips; semiconductor device reliability; semiconductor device testing; silicon; application aware diagnosis; current online testing mechanisms; microprocessor; reliability; runtime hardware faults; silicon devices; transistor wearout; Hardware; Monitoring; Pipelines; Radiation detectors; Software; Testing; Transistors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer-Aided Design (ICCAD), 2010 IEEE/ACM International Conference on
Conference_Location :
San Jose, CA
ISSN :
1092-3152
Print_ISBN :
978-1-4244-8193-4
Type :
conf
DOI :
10.1109/ICCAD.2010.5653788
Filename :
5653788
Link To Document :
بازگشت