• DocumentCode
    3200487
  • Title

    Filtering, Reductions and Synchronization in the Anton 2 Network

  • Author

    Grossman, J.P. ; Towles, Brian ; Greskamp, Brian ; Shaw, David E.

  • Author_Institution
    D.E. Shaw Res., New York, NY, USA
  • fYear
    2015
  • fDate
    25-29 May 2015
  • Firstpage
    860
  • Lastpage
    870
  • Abstract
    Parallel implementations of molecular dynamics (MD) simulation require significant inter-node communication, but off-chip communication bandwidth is not scaling as quickly as on-chip logic density. We present three network features targeting this problem that have been implemented in Anton 2, a massively parallel special-purpose supercomputer for MD simulations. The first is a mechanism to dynamically identify packets that do not need to be delivered to all endpoints within a multicast tree, these packets are filtered to conserve network bandwidth. The second is hardware for in-network reductions that supports over a thousand concurrent neighbourhood reductions per node and fast all-to-all global reductions. The third is a log-weight synchronization mechanism for multicast-reduce communication patterns that can be used to efficiently detect the completion of reduction operations when the number of summands is difficult to predict. We use the combination of packet filtering, in-network reductions and log-weight synchronization to decrease the communication requirements of MD simulations by as much as 51% on Anton 2, yielding application-level performance improvements of up to 14%.
  • Keywords
    biology computing; concurrency control; molecular biophysics; parallel processing; trees (mathematics); Anton 2 network; MD simulations; all-to-all global reductions; concurrent neighbourhood reductions; in-network reductions; internode communication; log-weight synchronization mechanism; molecular dynamics simulation; multicast tree; multicast-reduce communication patterns; network bandwidth conservation; off-chip communication bandwidth; on-chip logic density; packet filtering; packet identification; parallel implementations; parallel special-purpose supercomputer; Atomic measurements; Computational modeling; Filtering; Force; Hardware; Synchronization; System-on-chip; Anton; In-network reductions; Molecular Dynamics; Multicast; Synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International
  • Conference_Location
    Hyderabad
  • ISSN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2015.42
  • Filename
    7161572