DocumentCode
3459686
Title
Performance Portable Optimizations for Loops Containing Communication Operations
Author
Iancu, Costin ; Chen, Wei ; Yelick, Katherine
Author_Institution
Lawrence Berkeley Nat. Lab., Berkeley
fYear
2007
fDate
15-19 Sept. 2007
Firstpage
411
Lastpage
411
Abstract
As high end computing systems continue to scale in CPU computational power and overall node count, optimization techniques that can reduce communication overhead have proven important. We present a loop optimization framework designed to achieve both efficient communication/computation overlap and performance portability. The framework has been implemented in the Berkeley UPC compiler and uses a combination of compile time analysis and runtime mechanisms. We extend the compiler to perform message vectorization and message strip mining optimizations. At compile time loop nests are analyzed, their communication requirements are determined, and the computation overhead is estimated. The compiler passes analysis information to the runtime and performance portability is achieved by decoupling data movement from local computation. We generate template code that uses the transferred data without making any assumptions about the communication mechanism.
Keywords
optimising compilers; program control structures; communication operations; compile time analysis; message strip mining optimizations; message vectorization; portable loop optimization framework; runtime mechanisms; Communication system control; High performance computing; Laboratories; Optimizing compilers; Performance analysis; Portable computers; Productivity; Programming profession; Quality of service; Runtime;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Architecture and Compilation Techniques, 2007. PACT 2007. 16th International Conference on
Conference_Location
Brasov
ISSN
1089-795X
Print_ISBN
978-0-7695-2944-8
Type
conf
DOI
10.1109/PACT.2007.4336239
Filename
4336239
Link To Document