Title :
Anisotropic nonlinear diffusion for filtering 3D images on GPUs
Author :
Tabik, Siham ; Murarasu, Alin ; Romero, Luis F.
Author_Institution :
Dept. of Comput. Archit., Univ. of Malaga, Malaga, Spain
Abstract :
Optimizing sophisticated PDE-based filtering methods, such as the Anisotropic Nonlinear Diffusion (AND), to GPUs is complicated and time consuming. In this work, we expressed AND as iterative multiple 3D-stencils, where each 3D-stencil is implemented into one kernel, and then we analyzed all possible kernel fusions on the GPU. We experimentally found that fusing dependent stencils with similar concurrency and lower on-chip pressure makes the optimal combination run 1, 52× faster than the next better one.
Keywords :
filtering theory; graphics processing units; image processing; 3D image filtering; AND; GPU; PDE based filtering methods; anisotropic nonlinear diffusion; onchip pressure; Graphics processing units; Instruction sets; Kernel; Multicore processing; Optimization; Smoothing methods; Tensile stress;
Conference_Titel :
Cluster Computing (CLUSTER), 2014 IEEE International Conference on
Conference_Location :
Madrid
DOI :
10.1109/CLUSTER.2014.6968786