DocumentCode
185237
Title
Double precision stencil computations on Kepler GPUs
Author
Vizitiu, Anamaria ; Itu, Lucian ; Lazar, Laszlo ; Suciu, Constantin
Author_Institution
Corp. Technol., Imaging & Comput. Vision, Siemens SRL, Braşov, Romania
fYear
2014
fDate
17-19 Oct. 2014
Firstpage
123
Lastpage
127
Abstract
Graphics Processing Units (GPU) have been used extensively for accelerating parallelizable applications in general, and scientific computations in particular. Stencil based algorithms are used intensively in various research areas and represent good candidates for GPU based acceleration. Since scientific computations have high accuracy requirements, herein we focus on stencil based double precision computations. For a seven-point stencil we introduce two basic implementations, which use two-dimensional and three-dimensional thread organization respectively. Different optimization techniques lead then to a total of seven different implementations, which are evaluated for two NVIDIA Kepler GPUs. The best performance is obtained for the GTX680 card, for a kernel with two-dimensional thread organization and optimized shared memory and register usage.
Keywords
graphics processing units; optimisation; GPU based acceleration; GTX680 card; NVIDIA Kepler GPU; double precision stencil computation; graphics processing units; optimization technique; seven-point stencil; stencil based algorithm; three-dimensional thread organization; two-dimensional thread organization; Arrays; Graphics processing units; Instruction sets; Kernel; Organizations; Registers; GPU; Kepler; double precision; optimization; stencil;
fLanguage
English
Publisher
ieee
Conference_Titel
System Theory, Control and Computing (ICSTCC), 2014 18th International Conference
Conference_Location
Sinaia
Type
conf
DOI
10.1109/ICSTCC.2014.6982402
Filename
6982402
Link To Document