Title :
Balancing performance and fault detection for GPGPU workloads
Author :
Backer, Jerry B. ; Karri, Ramesh
Author_Institution :
Polytech. Inst., New York Univ., Brooklyn, OH, USA
fDate :
Sept. 30 2012-Oct. 3 2012
Abstract :
GPUs are increasingly being used for processing highly parallel scientific and high performance workloads. Such applications require correctness and accuracy of the computation. GPUs lack adequate support for detecting hardware faults that may lead to computation errors. We present a tunable fault detection scheme that allows one to balance GPU performance and fault checking by configuring the amount of resources to allocate for detection and the frequency of checking for faults.
Keywords :
fault diagnosis; graphics processing units; GPGPU workloads; balancing performance; hardware fault detection; Fault detection; Graphics processing units; Hardware; Instruction sets; Kernel; Redundancy; Transient analysis;
Conference_Titel :
Computer Design (ICCD), 2012 IEEE 30th International Conference on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4673-3051-0
DOI :
10.1109/ICCD.2012.6378702