Title :
Double precision stencil computations on Kepler GPUs
Author :
Vizitiu, Anamaria ; Itu, Lucian ; Lazar, Laszlo ; Suciu, Constantin
Author_Institution :
Corp. Technol., Imaging & Comput. Vision, Siemens SRL, Braşov, Romania
Abstract :
Graphics Processing Units (GPU) have been used extensively for accelerating parallelizable applications in general, and scientific computations in particular. Stencil based algorithms are used intensively in various research areas and represent good candidates for GPU based acceleration. Since scientific computations have high accuracy requirements, herein we focus on stencil based double precision computations. For a seven-point stencil we introduce two basic implementations, which use two-dimensional and three-dimensional thread organization respectively. Different optimization techniques lead then to a total of seven different implementations, which are evaluated for two NVIDIA Kepler GPUs. The best performance is obtained for the GTX680 card, for a kernel with two-dimensional thread organization and optimized shared memory and register usage.
Keywords :
graphics processing units; optimisation; GPU based acceleration; GTX680 card; NVIDIA Kepler GPU; double precision stencil computation; graphics processing units; optimization technique; seven-point stencil; stencil based algorithm; three-dimensional thread organization; two-dimensional thread organization; Arrays; Graphics processing units; Instruction sets; Kernel; Organizations; Registers; GPU; Kepler; double precision; optimization; stencil;
Conference_Titel :
System Theory, Control and Computing (ICSTCC), 2014 18th International Conference
Conference_Location :
Sinaia
DOI :
10.1109/ICSTCC.2014.6982402