DocumentCode
166705
Title
Anisotropic nonlinear diffusion for filtering 3D images on GPUs
Author
Tabik, Siham ; Murarasu, Alin ; Romero, Luis F.
Author_Institution
Dept. of Comput. Archit., Univ. of Malaga, Malaga, Spain
fYear
2014
fDate
22-26 Sept. 2014
Firstpage
339
Lastpage
345
Abstract
Optimizing sophisticated PDE-based filtering methods, such as the Anisotropic Nonlinear Diffusion (AND), to GPUs is complicated and time consuming. In this work, we expressed AND as iterative multiple 3D-stencils, where each 3D-stencil is implemented into one kernel, and then we analyzed all possible kernel fusions on the GPU. We experimentally found that fusing dependent stencils with similar concurrency and lower on-chip pressure makes the optimal combination run 1, 52× faster than the next better one.
Keywords
filtering theory; graphics processing units; image processing; 3D image filtering; AND; GPU; PDE based filtering methods; anisotropic nonlinear diffusion; onchip pressure; Graphics processing units; Instruction sets; Kernel; Multicore processing; Optimization; Smoothing methods; Tensile stress;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing (CLUSTER), 2014 IEEE International Conference on
Conference_Location
Madrid
Type
conf
DOI
10.1109/CLUSTER.2014.6968786
Filename
6968786
Link To Document