Title of article :
Efficient implementation of parallel three-dimensional FFT on clusters of PCs Original Research Article
Author/Authors :
DAISUKE TAKAHASHI، نويسنده ,
Issue Information :
دوهفته نامه با شماره پیاپی سال 2003
Pages :
7
From page :
144
To page :
150
Abstract :
In this paper, we propose a high-performance parallel three-dimensional fast Fourier transform (FFT) algorithm on clusters of PCs. The three-dimensional FFT algorithm can be altered into a block three-dimensional FFT algorithm to reduce the number of cache misses. We show that the block three-dimensional FFT algorithm improves performance by utilizing the cache memory effectively. We use the block three-dimensional FFT algorithm to implement the parallel three-dimensional FFT algorithm. We succeeded in obtaining performance of over 1.3 GFLOPS on an 8-node dual Pentium III 1 GHz PC SMP cluster.
Keywords :
Fast Fourier Transform , Clusters of PCs , Cache memory , All-to-all communication , Cyclic distribution
Journal title :
Computer Physics Communications
Serial Year :
2003
Journal title :
Computer Physics Communications
Record number :
1136151
Link To Document :
بازگشت