DocumentCode :
580129
Title :
A 32x32x32, spatially distributed 3D FFT in four microseconds on Anton
Author :
Young, Cliff ; Bank, Joseph A. ; Dror, Ron O. ; Grossman, J.P. ; Salmon, John K. ; Shaw, David E.
Author_Institution :
D.E. Shaw Res., New York, NY, USA
fYear :
2009
fDate :
14-20 Nov. 2009
Firstpage :
1
Lastpage :
11
Abstract :
Anton, a massively parallel special-purpose machine for molecular dynamics simulations, performs a 32 × 32 × 32 FFT in 3.7 microseconds and a 64 × 64 × 64 FFT in 13.3 microseconds on a configuration with 512 nodes-an order of magnitude faster than all other FFT implementations of which we are aware. Achieving this FFT performance requires a coordinated combination of computation and communication techniques that leverage Anton´s underlying hardware mechanisms. Most significantly, Anton´s communication subsystem provides over 300 gigabits per second of bandwidth per node, message latency in the hundreds of nanoseconds, and support for word-level writes and single-ended communication. In addition, Anton´s general-purpose computation system incorporates primitives that support the efficient parallelization of small 1D FFTs. Although Anton was designed specifically for molecular dynamics simulations, a number of the hardware primitives and software implementation techniques described in this paper may also be applicable to the acceleration of FFTs on general-purpose high-performance machines.
Keywords :
chemistry computing; fast Fourier transforms; molecular dynamics method; parallel machines; Anton communication subsystem; Anton general-purpose computation system; FFT performance; communication technique; general-purpose high-performance machine; hardware primitive; leverage Anton underlying hardware mechanism; massively parallel special-purpose machine; message latency; molecular dynamics simulation; parallelization; single-ended communication; software implementation technique; spatially distributed 3D FFT; word-level write;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing Networking, Storage and Analysis, Proceedings of the Conference on
Conference_Location :
Portland, OR
Type :
conf
DOI :
10.1145/1654059.1654083
Filename :
6375578
Link To Document :
بازگشت