• DocumentCode
    294881
  • Title

    Comparison of 2-D FFT implementations on the Intel Paragon massively parallel supercomputer

  • Author

    An, M. ; Anupindi, N. ; Bletsas, M. ; Kechriotis, G. ; Lu, C. ; Manolakos, E.S. ; Tolimieri, R.

  • Author_Institution
    Aware Inc., Cambridge, MA, USA
  • Volume
    4
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    2755
  • Abstract
    We discuss the parallel implementation of multidimensional FFTs on distributed memory multiprocessor machines. We introduce a compact notation to describe four equivalent parallel algorithms and discuss their advantages and disadvantages. Two algorithms, suitable for the case when initial and final data are distributed either row- or column-wise, the traditional row-column (RC) and a variation of the vector radix (VR) that we call partial vector radix (PVR) are presented and their efficiency on the Paragon is compared. It is shown that the PVR, although it requires larger amount of interprocessor communication, results in more efficient implementations due to the regularity of local and distributed memory accesses. For the case in which data are partitioned along both dimensions, two suitable parallel algorithms, the collect-distribute (CD) and the general full vector-radix (FVR), are presented. Again, it is shown that regularity in memory accesses for the case of the FVR, results in more efficient implementations
  • Keywords
    distributed memory systems; fast Fourier transforms; parallel algorithms; parallel machines; signal processing; 2-D FFT; Intel Paragon massively parallel supercomputer; Paragon; access memory regularity; collect-distribute algorithm; column-wise data; distributed memory access; distributed memory multiprocessor machines; full vector-radix algorithm; interprocessor communication; local memory access; multidimensional FFT; parallel algorithms; partial vector radix; row-wise data; signal processing; Computer science; Digital signal processing; Discrete Fourier transforms; Drives; Flexible printed circuits; Military computing; Multidimensional systems; Parallel algorithms; Signal processing algorithms; Supercomputers; Virtual reality;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.480132
  • Filename
    480132