Title :
CASTED: Core-Adaptive Software Transient Error Detection for Tightly Coupled Cores
Author :
Mitropoulou, Konstantina ; Porpodas, Vasileios ; Cintra, M.
Author_Institution :
Sch. of Inf., Univ. of Edinburgh, Edinburgh, UK
Abstract :
Aggressive silicon process scaling over the last years has made transistors faster and less power consuming. Meanwhile, transistors have become more susceptible to errors. The need to maintain high reliability has led to the development of various software-based error detection methodologies which target either single-core or multi-core processors. In this work, we present CASTED, a Core-Adaptive Soft- ware Transient Error Detection methodology that focuses on improving the impact of error detection overhead on single- chip scalable architectures that are composed of tightly coupled cores. The proposed compiler methodology adaptively distributes the error detection overhead to the available resources across multiple cores, fully exploiting the abundant ILP of these architectures. CASTED adapts to a wide range of architecture configurations (issue-width, inter-core delay). We evaluate our technique on a range of architecture configurations using the MediabenchII video and SPEC CINT2000 benchmark suites. Our approach successfully adapts to (and regularly outperforms by up to 21.2%) the best fixed state-of- the-art approach while maintaining the same fault coverage.
Keywords :
error detection; multiprocessing systems; program compilers; software architecture; software reliability; CASTED; ILP; MediabenchII video; SPEC CINT2000 benchmark suites; aggressive silicon process scaling; architecture configurations; compiler methodology; core-adaptive software transient error detection; error detection overhead; multicore processors; power consuming; reliability; single-chip scalable architectures; single-core processor; software-based error detection methodology; tightly coupled cores; Clustering algorithms; Computer architecture; Delays; Hardware; Program processors; Registers; Transient analysis; adaptivity; software error detection;
Conference_Titel :
Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4673-6066-1
DOI :
10.1109/IPDPS.2013.107