• DocumentCode
    3118476
  • Title

    Increasing the robustness of CUDA Fermi GPU-based systems

  • Author

    Di Carlo, S. ; Gambardella, Giulio ; Indaco, M. ; Martella, Ippazio ; Prinetto, P. ; Rolfo, Daniele ; Trotta, Pascal

  • Author_Institution
    Dipt. di Autom. e Inf., Politec. di Torino, Turin, Italy
  • fYear
    2013
  • fDate
    8-10 July 2013
  • Firstpage
    234
  • Lastpage
    235
  • Abstract
    Nowadays Graphical processing Units (GPUs) have become increasingly popular due to their high computational power and low prices. This makes them particularly suitable for high-performance computing applications, like data elaboration and image processing. In these fields, the capability of properly work even in presence of faults is mandatory. This paper presents an innovative approach, that combines a Software Based Self Test & Diagnosis (SBSTD) methodology with a fault mitigation strategy, to increase the robustness of a CUDA Fermi GPU-based system.
  • Keywords
    built-in self test; fault diagnosis; fault tolerant computing; graphics processing units; parallel architectures; parallel programming; CUDA Fermi GPU-based systems; SBSTD methodology; fault mitigation strategy; graphical processing units; high-performance computing applications; software based self test and diagnosis methodology; Computer architecture; Fault tolerance; Fault tolerant systems; Graphics processing units; Kernel; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    On-Line Testing Symposium (IOLTS), 2013 IEEE 19th International
  • Conference_Location
    Chania
  • Type

    conf

  • DOI
    10.1109/IOLTS.2013.6604088
  • Filename
    6604088