• DocumentCode
    185237
  • Title

    Double precision stencil computations on Kepler GPUs

  • Author

    Vizitiu, Anamaria ; Itu, Lucian ; Lazar, Laszlo ; Suciu, Constantin

  • Author_Institution
    Corp. Technol., Imaging & Comput. Vision, Siemens SRL, Braşov, Romania
  • fYear
    2014
  • fDate
    17-19 Oct. 2014
  • Firstpage
    123
  • Lastpage
    127
  • Abstract
    Graphics Processing Units (GPU) have been used extensively for accelerating parallelizable applications in general, and scientific computations in particular. Stencil based algorithms are used intensively in various research areas and represent good candidates for GPU based acceleration. Since scientific computations have high accuracy requirements, herein we focus on stencil based double precision computations. For a seven-point stencil we introduce two basic implementations, which use two-dimensional and three-dimensional thread organization respectively. Different optimization techniques lead then to a total of seven different implementations, which are evaluated for two NVIDIA Kepler GPUs. The best performance is obtained for the GTX680 card, for a kernel with two-dimensional thread organization and optimized shared memory and register usage.
  • Keywords
    graphics processing units; optimisation; GPU based acceleration; GTX680 card; NVIDIA Kepler GPU; double precision stencil computation; graphics processing units; optimization technique; seven-point stencil; stencil based algorithm; three-dimensional thread organization; two-dimensional thread organization; Arrays; Graphics processing units; Instruction sets; Kernel; Organizations; Registers; GPU; Kepler; double precision; optimization; stencil;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Theory, Control and Computing (ICSTCC), 2014 18th International Conference
  • Conference_Location
    Sinaia
  • Type

    conf

  • DOI
    10.1109/ICSTCC.2014.6982402
  • Filename
    6982402