• DocumentCode
    580129
  • Title

    A 32x32x32, spatially distributed 3D FFT in four microseconds on Anton

  • Author

    Young, Cliff ; Bank, Joseph A. ; Dror, Ron O. ; Grossman, J.P. ; Salmon, John K. ; Shaw, David E.

  • Author_Institution
    D.E. Shaw Res., New York, NY, USA
  • fYear
    2009
  • fDate
    14-20 Nov. 2009
  • Firstpage
    1
  • Lastpage
    11
  • Abstract
    Anton, a massively parallel special-purpose machine for molecular dynamics simulations, performs a 32 × 32 × 32 FFT in 3.7 microseconds and a 64 × 64 × 64 FFT in 13.3 microseconds on a configuration with 512 nodes-an order of magnitude faster than all other FFT implementations of which we are aware. Achieving this FFT performance requires a coordinated combination of computation and communication techniques that leverage Anton´s underlying hardware mechanisms. Most significantly, Anton´s communication subsystem provides over 300 gigabits per second of bandwidth per node, message latency in the hundreds of nanoseconds, and support for word-level writes and single-ended communication. In addition, Anton´s general-purpose computation system incorporates primitives that support the efficient parallelization of small 1D FFTs. Although Anton was designed specifically for molecular dynamics simulations, a number of the hardware primitives and software implementation techniques described in this paper may also be applicable to the acceleration of FFTs on general-purpose high-performance machines.
  • Keywords
    chemistry computing; fast Fourier transforms; molecular dynamics method; parallel machines; Anton communication subsystem; Anton general-purpose computation system; FFT performance; communication technique; general-purpose high-performance machine; hardware primitive; leverage Anton underlying hardware mechanism; massively parallel special-purpose machine; message latency; molecular dynamics simulation; parallelization; single-ended communication; software implementation technique; spatially distributed 3D FFT; word-level write;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing Networking, Storage and Analysis, Proceedings of the Conference on
  • Conference_Location
    Portland, OR
  • Type

    conf

  • DOI
    10.1145/1654059.1654083
  • Filename
    6375578