Title :
CUDA and OpenCL implementations of 3D Fast Wavelet Transform
Author :
Bernabé, Gregorio ; Guerrero, Ginés D. ; Fernández, Juan
Author_Institution :
Comput. Eng. Dept., Univ. of Murcia, Murcia, Spain
Abstract :
We present in this paper several implementations of the 3D Fast Wavelet Transform (3D-FWT) on CUDA and OpenCL running on a new Fermi Tesla architecture. We evaluate these proposals and make a comparison with others optimal executed on multicores CPU and Nvidia Tesla C870. Speedups of the CUDA version on Fermi architecture are the best results, improving the execution times on CPU, ranging from 5.3× to 7.4× for different image sizes, and up to 81 times faster when communications are neglected. Meanwhile, OpenCL obtains solid gains which range from 2× factors on small frame sizes to 3× factors on larger ones.
Keywords :
parallel architectures; wavelet transforms; 3D fast wavelet transform; CUDA implementations; Fermi Tesla architecture; Nvidia Tesla C870; OpenCL implementations; multicores CPU; open computing language; Computer architecture; Graphics processing unit; Hardware; Instruction sets; Programming; Wavelet transforms;
Conference_Titel :
Circuits and Systems (LASCAS), 2012 IEEE Third Latin American Symposium on
Conference_Location :
Playa del Carmen
Print_ISBN :
978-1-4673-1207-3
DOI :
10.1109/LASCAS.2012.6180318