DocumentCode :
1285869
Title :
Accelerating Wavelet Lifting on Graphics Hardware Using CUDA
Author :
Van Der Laan, Wladimir J. ; Jalba, Andrei C. ; Roerdink, Jos B T M
Author_Institution :
Johann Bernoulli Inst. for Math. & Comput. Sci., Univ. of Groningen, Groningen, Netherlands
Volume :
22
Issue :
1
fYear :
2011
Firstpage :
132
Lastpage :
146
Abstract :
The Discrete Wavelet Transform (DWT) has a wide range of applications from signal processing to video and image compression. We show that this transform, by means of the lifting scheme, can be performed in a memory and computation-efficient way on modern, programmable GPUs, which can be regarded as massively parallel coprocessors through NVidia´s CUDA compute paradigm. The three main hardware architectures for the 2D DWT (row-column, line-based, block-based) are shown to be unsuitable for a CUDA implementation. Our CUDA-specific design can be regarded as a hybrid method between the row-column and block-based methods. We achieve considerable speedups compared to an optimized CPU implementation and earlier non-CUDA-based GPU DWT methods, both for 2D images and 3D volume data. Additionally, memory usage can be reduced significantly compared to previous GPU DWT methods. The method is scalable and the fastest GPU implementation among the methods considered. A performance analysis shows that the results of our CUDA-specific design are in close agreement with our theoretical complexity analysis.
Keywords :
computer graphic equipment; coprocessors; data compression; discrete wavelet transforms; parallel architectures; solid modelling; video coding; 2D DWT; 2D images; 3D volume; CUDA specific design; NVidia CUDA compute paradigm; accelerating wavelet lifting; block based method; discrete wavelet transform; fastest GPU implementation; graphics hardware; hardware architectures; image compression; memory usage; nonCUDA-based GPU DWT methods; optimized CPU implementation; parallel coprocessors; programmable GPU; signal processing; video compression; Acceleration; Approximation methods; Concurrent computing; Discrete transforms; Discrete wavelet transforms; Fractals; Graphics; Graphics processing unit; Hardware; Image coding; Performance analysis; Video compression; Video signal processing; CUDA.; Discrete wavelet transform; graphics hardware; wavelet lifting;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2010.143
Filename :
5539768
Link To Document :
بازگشت