DocumentCode :
3200487
Title :
Filtering, Reductions and Synchronization in the Anton 2 Network
Author :
Grossman, J.P. ; Towles, Brian ; Greskamp, Brian ; Shaw, David E.
Author_Institution :
D.E. Shaw Res., New York, NY, USA
fYear :
2015
fDate :
25-29 May 2015
Firstpage :
860
Lastpage :
870
Abstract :
Parallel implementations of molecular dynamics (MD) simulation require significant inter-node communication, but off-chip communication bandwidth is not scaling as quickly as on-chip logic density. We present three network features targeting this problem that have been implemented in Anton 2, a massively parallel special-purpose supercomputer for MD simulations. The first is a mechanism to dynamically identify packets that do not need to be delivered to all endpoints within a multicast tree, these packets are filtered to conserve network bandwidth. The second is hardware for in-network reductions that supports over a thousand concurrent neighbourhood reductions per node and fast all-to-all global reductions. The third is a log-weight synchronization mechanism for multicast-reduce communication patterns that can be used to efficiently detect the completion of reduction operations when the number of summands is difficult to predict. We use the combination of packet filtering, in-network reductions and log-weight synchronization to decrease the communication requirements of MD simulations by as much as 51% on Anton 2, yielding application-level performance improvements of up to 14%.
Keywords :
biology computing; concurrency control; molecular biophysics; parallel processing; trees (mathematics); Anton 2 network; MD simulations; all-to-all global reductions; concurrent neighbourhood reductions; in-network reductions; internode communication; log-weight synchronization mechanism; molecular dynamics simulation; multicast tree; multicast-reduce communication patterns; network bandwidth conservation; off-chip communication bandwidth; on-chip logic density; packet filtering; packet identification; parallel implementations; parallel special-purpose supercomputer; Atomic measurements; Computational modeling; Filtering; Force; Hardware; Synchronization; System-on-chip; Anton; In-network reductions; Molecular Dynamics; Multicast; Synchronization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International
Conference_Location :
Hyderabad
ISSN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2015.42
Filename :
7161572
Link To Document :
بازگشت