DocumentCode
506180
Title
FFTs in external or hierarchical memory
Author
Bailey, David H.
Author_Institution
NASA Ames Research Center, Moffett Field, CA
fYear
1989
fDate
12-17 Nov. 1989
Firstpage
234
Lastpage
242
Abstract
Conventional algorithms for computing large one-dimensional fast Fourier transforms (FFTs), even those algorithms recently developed for vector and parallel computers, are largely unsuitable for systems with external or hierarchical memory. The principal reason for this is the fact that most FFT algorithms require at least m complete passes through the data set to compute a 2m-point FFT. This paper describes some advanced techniques for computing an ordered FFT on a computer with external or hierarchical memory. These algorithms (1) require as few as two passes through the external data set, (2) employ strictly unit stride, long vector transfers between main memory and external storage, (3) require only a modest amount of scratch space in main memory, and (4) are well suited for vector and parallel computation. Performance figures are included for implementations of some of these algorithms on Cray supercomputers. Of interest is the fact that a main memory version outperforms the current Cray library FFT routines on the Cray-2, the Cray X-MP, and the Cray Y-MP systems. Using all eight processors on the Cray Y-MP, this main memory routine runs at nearly two gigaflops.
Keywords
Algorithm design and analysis; Concurrent computing; Distributed computing; Fast Fourier transforms; Flexible printed circuits; Frequency; Libraries; Machinery; NASA; Permission;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing, 1989. Supercomputing '89. Proceedings of the 1989 ACM/IEEE Conference on
Conference_Location
Reno, NV, USA
Print_ISBN
0-89791-341-8
Type
conf
DOI
10.1145/76263.76288
Filename
5349015
Link To Document