DocumentCode :
239534
Title :
Using massively parallel simulation for mpi collective communication modeling in extreme-scale networks
Author :
Mubarak, Misbah ; Carothers, Christopher D. ; Ross, Robert B. ; Carns, Philip
Author_Institution :
Comput. Sci. Dept., Rensselaer Polytech. Inst., Troy, NY, USA
fYear :
2014
fDate :
7-10 Dec. 2014
Firstpage :
3107
Lastpage :
3118
Abstract :
MPI collective operations are a critical and frequently used part of most MPI-based large-scale scientific applications. In previous work, we have enabled the Rensselaer Optimistic Simulation System (ROSS) to predict the performance of MPI point-to-point messaging on high-fidelity million-node network simulations of torus and dragonfly interconnects. The main contribution of this work is an extension of these torus and dragonfly network models to support MPI collective communication operations using the optimistic event scheduling capability of ROSS. We demonstrate that both small- and large-scale ROSS collective communication models can execute efficiency on massively parallel architectures. We validate the results of our collective communication model against the measurements from IBM Blue Gene/Q and Cray XC30 platforms using a data-driven approach on our network simulations. We also perform experiments to explore the impact of tree degree on the performance of collective communication operations in large-scale network models.
Keywords :
application program interfaces; message passing; Cray XC30 platforms; IBM Blue Gene/Q; MPI collective communication modeling; MPI collective communication operations; MPI collective operations; MPI point-to-point messaging; MPI-based large scale scientific applications; ROSS collective communication models; Rensselaer optimistic simulation system; dragonfly network models; extreme scale networks; high fidelity million-node network simulations; large scale network models; massively parallel architectures; massively parallel simulation; optimistic event scheduling capability; Bandwidth; Computational modeling; Computer architecture; Network topology; Predictive models; Synchronization; Topology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Simulation Conference (WSC), 2014 Winter
Conference_Location :
Savanah, GA
Print_ISBN :
978-1-4799-7484-9
Type :
conf
DOI :
10.1109/WSC.2014.7020148
Filename :
7020148
Link To Document :
بازگشت