• DocumentCode
    2481906
  • Title

    Balancing performance and fault detection for GPGPU workloads

  • Author

    Backer, Jerry B. ; Karri, Ramesh

  • Author_Institution
    Polytech. Inst., New York Univ., Brooklyn, OH, USA
  • fYear
    2012
  • fDate
    Sept. 30 2012-Oct. 3 2012
  • Firstpage
    518
  • Lastpage
    519
  • Abstract
    GPUs are increasingly being used for processing highly parallel scientific and high performance workloads. Such applications require correctness and accuracy of the computation. GPUs lack adequate support for detecting hardware faults that may lead to computation errors. We present a tunable fault detection scheme that allows one to balance GPU performance and fault checking by configuring the amount of resources to allocate for detection and the frequency of checking for faults.
  • Keywords
    fault diagnosis; graphics processing units; GPGPU workloads; balancing performance; hardware fault detection; Fault detection; Graphics processing units; Hardware; Instruction sets; Kernel; Redundancy; Transient analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Design (ICCD), 2012 IEEE 30th International Conference on
  • Conference_Location
    Montreal, QC
  • ISSN
    1063-6404
  • Print_ISBN
    978-1-4673-3051-0
  • Type

    conf

  • DOI
    10.1109/ICCD.2012.6378702
  • Filename
    6378702