DocumentCode
2548869
Title
A Simulation Framework to Automatically Analyze the Communication-Computation Overlap in Scientific Applications
Author
Subotic, Vladimir ; Sancho, Jose Carlos ; Labarta, Jesus ; Valero, Mateo
Author_Institution
Barcelona Supercomput. Center, Barcelona, Spain
fYear
2010
fDate
20-24 Sept. 2010
Firstpage
275
Lastpage
283
Abstract
Overlapping communication and computation has been devised as an attractive technique to alleviate the huge application´s network requirements at large scale. Overlapping will allow to fully or partially hide the long communication delays suffered when transferring messages through the network. This will relax the application´s network requirements, and hence allow to deploy more cost-effective network designs. However, today´s scientific applications make little use of overlapping. In addition, there is no support to analyze how overlap could impact the performance of real scientific applications. In this paper we address this issue by presenting a simulation framework to automatically analyze the benefits of communication-computation overlap. The simulation framework consists of a binary translation tool (Valgrind), a distributed machine simulator (Dimemas), and a visualization tool (Paraver). Valgrind instruments the legacy MPI application and generates the execution traces, then Dimemas uses the obtained traces and reconstructs the application´s time-behavior on a configurable parallel platform, and finally Paraver visualizes the obtained time-behaviors. Our simulation methodology brings two new features into the study of overlap: 1) automatic simulation of the overlapped execution - as there is no need for code restructuring in applications; and 2) visualization of simulated time behaviors, that further allows useful comparisons of the non-overlapped and the overlapped executions.
Keywords
application program interfaces; message passing; MPI application; binary translation tool; code restructuring; communication delays; communication-computation overlap; configurable parallel platform; cost-effective network designs; distributed machine simulator; network requirements; overlapping communication; scientific applications; simulation framework; visualization tool; Analytical models; Bandwidth; Buffer storage; Computational modeling; Delay; Production; Receivers; Communication-Computation Overlap; MPI;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing (CLUSTER), 2010 IEEE International Conference on
Conference_Location
Heraklion, Crete
Print_ISBN
978-1-4244-8373-0
Electronic_ISBN
978-0-7695-4220-1
Type
conf
DOI
10.1109/CLUSTER.2010.33
Filename
5600298
Link To Document