DocumentCode :
1691732
Title :
CUDA and OpenCL implementations of 3D Fast Wavelet Transform
Author :
Bernabé, Gregorio ; Guerrero, Ginés D. ; Fernández, Juan
Author_Institution :
Comput. Eng. Dept., Univ. of Murcia, Murcia, Spain
fYear :
2012
Firstpage :
1
Lastpage :
4
Abstract :
We present in this paper several implementations of the 3D Fast Wavelet Transform (3D-FWT) on CUDA and OpenCL running on a new Fermi Tesla architecture. We evaluate these proposals and make a comparison with others optimal executed on multicores CPU and Nvidia Tesla C870. Speedups of the CUDA version on Fermi architecture are the best results, improving the execution times on CPU, ranging from 5.3× to 7.4× for different image sizes, and up to 81 times faster when communications are neglected. Meanwhile, OpenCL obtains solid gains which range from 2× factors on small frame sizes to 3× factors on larger ones.
Keywords :
parallel architectures; wavelet transforms; 3D fast wavelet transform; CUDA implementations; Fermi Tesla architecture; Nvidia Tesla C870; OpenCL implementations; multicores CPU; open computing language; Computer architecture; Graphics processing unit; Hardware; Instruction sets; Programming; Wavelet transforms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems (LASCAS), 2012 IEEE Third Latin American Symposium on
Conference_Location :
Playa del Carmen
Print_ISBN :
978-1-4673-1207-3
Type :
conf
DOI :
10.1109/LASCAS.2012.6180318
Filename :
6180318
Link To Document :
بازگشت