DocumentCode
2061429
Title
High-performance parallel FFT algorithms for the HITACHI SR8000
Author
Takahashi, Daisuke
Author_Institution
Inf. Technol. Center, Tokyo Univ., Japan
Volume
1
fYear
2000
fDate
14-17 May 2000
Firstpage
192
Abstract
We propose high-performance parallel one-dimensional fast Fourier transform (FFT) algorithms for distributed memory parallel computers with vector symmetric multiprocessor (SMP) nodes. The four-step FFT algorithm can be altered into a five-step FFT algorithm to expand the innermost loop length. We use the four-step and five-step algorithms to implement the parallel one-dimensional FFT algorithms. In our proposed parallel FFT algorithms, since we use cyclic distribution, all-to-all communication takes place only once. Moreover, the input data and output data are both in natural order. Performance results of one-dimensional power-of-two FFTs on a distributed memory parallel computer with (pseudo) vector SMP nodes, HITACHI SR8000, are reported. We succeeded in obtaining performance of about 38 GFLOPS on a 16-node SR8000.
Keywords
distributed memory systems; fast Fourier transforms; mathematics computing; parallel algorithms; parallel machines; performance evaluation; 38 GFLOPS; HITACHI SR8000; all-to-all communication; cyclic distribution; distributed memory parallel computers; five-step algorithms; four-step algorithms; high-performance parallel algorithms; parallel fast Fourier transform algorithms; vector symmetric multiprocessor nodes;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing in the Asia-Pacific Region, 2000. Proceedings. The Fourth International Conference/Exhibition on
Conference_Location
Beijing, China
Print_ISBN
0-7695-0589-2
Type
conf
DOI
10.1109/HPC.2000.846545
Filename
846545
Link To Document