DocumentCode :
402632
Title :
Mobile and replicated alignment of arrays in data-parallel programs
Author :
Chatterjee, Siddhartha ; Gilbert, John R. ; Schreiber, Robert
Author_Institution :
NASA Ames Res. Center, Moffett Field, CA, USA
fYear :
1993
fDate :
15-19 Nov. 1993
Firstpage :
420
Lastpage :
429
Abstract :
When a data-parallel language like Fortran 90 is compiled for a distributed-memory machine, aggregate data objects (such as arrays) are distributed across the processor memories. The mapping determines the amount of residual communication needed to bring operands of parallel operations into alignment with each other. A common approach is to break the mapping into two stages: first, an alignment that maps all the objects to an abstract template, and then a distribution that maps the template to the processors. The authors solve two facets of the problem of finding alignments that reduce residual communication, i.e., determining both the alignments that vary in loops, and the objects that should have replicated alignments. They show that loop-dependent mobile alignment is sometimes necessary for optimum performance, and they provide algorithms with which a compiler can determine good mobile alignments for objects within do loops. They also identify situations in which replicated alignment is either required by the program itself (via spread operations) or can be used to improve performance. An algorithm based on network flow that determines which objects to replicate so as to minimize the total amount of broadcast communication in replication is proposed.
Keywords :
distributed memory systems; parallel algorithms; parallel processing; parallel programming; program compilers; software performance evaluation; Fortran 90; abstract template; aggregate data objects; arrays; broadcast communication; compiler; data-parallel language; data-parallel programs; distributed-memory machine; loop-dependent mobile alignment; network flow; optimum performance; parallel algorithms; parallel operations; processor memories; replicated alignment; residual communication; spread operations; Aggregates; Automatic control; Broadcasting; Cost function; Educational institutions; Mobile communication; NASA; Parallel processing; Phased arrays; Postal services;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing '93. Proceedings
ISSN :
1063-9535
Print_ISBN :
0-8186-4340-4
Type :
conf
DOI :
10.1109/SUPERC.1993.1263489
Filename :
1263489
Link To Document :
بازگشت