DocumentCode
2798763
Title
Transformations to Parallel Codes for Communication-Computation Overlap
Author
Danalis, Anthony ; Kim, Ki-Yong ; Pollock, Lori ; Swany, Martin
Author_Institution
University of Delaware
fYear
2005
fDate
12-18 Nov. 2005
Firstpage
58
Lastpage
58
Abstract
This paper presents program transformations directed toward improving communication-computation overlap in parallel programs that use MPI’s collective operations. Our transformations target a wide variety of applications focusing on scientific codes with computation loops that exhibit limited dependence among iterations. We include guidance for developers for transforming an application code in order to exploit the communicationcomputation overlap available in the underlying cluster, as well as a discussion of the performance improvements achieved by our transformations. We present results from a detailed study of the effect of the problem and message size, level of communication-computation overlap, and amount of communication aggregation on runtime performance in a cluster environment based on an RDMA-enabled network. The targets of our study are two scientific codes written by domain scientists, but the applicability of our work extends far beyond the scope of these two applications.
Keywords
Application software; Bandwidth; Concurrent computing; Delay; Parallel processing; Permission; Power engineering and energy; Runtime environment; Tiles; Workstations;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing, 2005. Proceedings of the ACM/IEEE SC 2005 Conference
Print_ISBN
1-59593-061-2
Type
conf
DOI
10.1109/SC.2005.75
Filename
1560010
Link To Document