DocumentCode :
2484998
Title :
A super-efficient adaptable bit-reversal algorithm for multithreaded architectures
Author :
Elster, Anne C. ; Meyer, Jan C.
Author_Institution :
Dept. of Comput. & Inf. Sci., Norwegian Univ. of Sci. & Technol., Trondheim, Norway
fYear :
2009
fDate :
23-29 May 2009
Firstpage :
1
Lastpage :
8
Abstract :
Fast bit-reversal algorithms have been of strong interest for many decades, especially after Cooley and Tukey introduced their FFT implementation in 1965. Many recent algorithms, including FFTW try to avoid the bit-reversal all together by doing in-place algorithms within their FFTs. We therefore motivate our work by showing that for FFTs of up to 65.536 points, a minimally tuned Cooley-Tukey FFT in C using our bit-reversal algorithm performs comparable or better than the default FFTW algorithm. In this paper, we present an extremely fast linear bit-reversal adapted for modern multithreaded architectures. Our bit-reversal algorithm takes advantage of recursive calls combined with the fact that it only generates pairs of indices for which the corresponding elements need to be exchanges, thereby avoiding any explicit tests. In addition we have implemented an adaptive approach which explores the trade-off between compile time and run-time work load. By generating look-up tables at compile time, our algorithm becomes even faster at run-time. Our results also show that by using more than one thread on tightly coupled architectures, further speed-up can be achieved.
Keywords :
fast Fourier transforms; multi-threading; FFT; bit-reversal algorithm; fast Fourier transform; multithreaded architecture; Arithmetic; Computer architecture; Fast Fourier transforms; Flexible printed circuits; Information science; Performance evaluation; Runtime; Sorting; Testing; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
Conference_Location :
Rome
ISSN :
1530-2075
Print_ISBN :
978-1-4244-3751-1
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2009.5161105
Filename :
5161105
Link To Document :
بازگشت