Title :
High-performance parallel FFT algorithms for the HITACHI SR8000
Author :
Takahashi, Daisuke
Author_Institution :
Inf. Technol. Center, Tokyo Univ., Japan
Abstract :
We propose high-performance parallel one-dimensional fast Fourier transform (FFT) algorithms for distributed memory parallel computers with vector symmetric multiprocessor (SMP) nodes. The four-step FFT algorithm can be altered into a five-step FFT algorithm to expand the innermost loop length. We use the four-step and five-step algorithms to implement the parallel one-dimensional FFT algorithms. In our proposed parallel FFT algorithms, since we use cyclic distribution, all-to-all communication takes place only once. Moreover, the input data and output data are both in natural order. Performance results of one-dimensional power-of-two FFTs on a distributed memory parallel computer with (pseudo) vector SMP nodes, HITACHI SR8000, are reported. We succeeded in obtaining performance of about 38 GFLOPS on a 16-node SR8000.
Keywords :
distributed memory systems; fast Fourier transforms; mathematics computing; parallel algorithms; parallel machines; performance evaluation; 38 GFLOPS; HITACHI SR8000; all-to-all communication; cyclic distribution; distributed memory parallel computers; five-step algorithms; four-step algorithms; high-performance parallel algorithms; parallel fast Fourier transform algorithms; vector symmetric multiprocessor nodes;
Conference_Titel :
High Performance Computing in the Asia-Pacific Region, 2000. Proceedings. The Fourth International Conference/Exhibition on
Conference_Location :
Beijing, China
Print_ISBN :
0-7695-0589-2
DOI :
10.1109/HPC.2000.846545