Title :
FFT-based matching pursuit implementation on CUDA platform
Author :
Li, Song ; Yin, Zhongke ; Wang, Jianying
Author_Institution :
School of Information Science and Technology, Southwest Jiaotong University, Chengdu, China
Abstract :
Matching pursuit adaptively decomposes signals in a redundant dictionary to achieve some sub-optimal non-orthogonal sparse representations. However, due to the redundancy of the dictionary, MP is usually very time consuming. FFT-based MP implementation runs significantly faster than greedy MP implementation, yet it still may take days to decompose an image on some dictionaries with high redundancy. This paper presents an implementation of FFT-based matching pursuit algorithm on CUDA platform for sparse decomposition of images. We found that FFT based MP presents strong data parallelism, thus suited for implementing on CUDA platform and executed in a parallel way on CUDA-capable GPU devices. Experiments results show that several dozen times of speedup ratio can be easily achieved.
Keywords :
Arrays; Convolution; Dictionaries; Graphics processing unit; Indexes; Kernel; Matching pursuit algorithms; CUDA; FFT; Matching pursuit; sparse decomposition;
Conference_Titel :
Information Science and Engineering (ICISE), 2010 2nd International Conference on
Conference_Location :
Hangzhou, China
Print_ISBN :
978-1-4244-7616-9
DOI :
10.1109/ICISE.2010.5689455