DocumentCode :
3350679
Title :
Efficient parallelisation using combined loop and data transformations
Author :
O´Boyle, M.F.P. ; Knijnenburg, P.M.W.
Author_Institution :
Inst. for Comput. Syst. Archit., Edinburgh Univ., UK
fYear :
1999
fDate :
1999
Firstpage :
283
Lastpage :
291
Abstract :
The paper attempts to minimise parallelisation overhead on distributed shared memory machines, such as the SGi Origin 2000, by the combination of non-singular loop and data transformations. We show that conflicting requirements on a loop transformation may be resolved by using a data transformation and vice-versa. We develop optimisation criteria for locality, synchronisation and communication and show that neither loop nor data transformations can be solely used for efficient parallelisation. This leads to the development of a novel global optimisation heuristic which is applied to 3 SPEC kernels where it is shown to outperform techniques solely based on loop or data transformations and to give significant improvement over an existing state-of-the-art commercial auto-paralleliser
Keywords :
automatic programming; distributed shared memory systems; optimising compilers; parallel programming; parallelising compilers; program control structures; SGi Origin 2000; SPEC kernels; combined loop/data transformations; conflicting requirements; data transformation; distributed shared memory machines; global optimisation heuristic; loop transformation; optimisation criteria; parallelisation overhead; state-of-the-art commercial auto-paralleliser; synchronisation; Computer architecture; Computer science; Context; Costs; Delay; Kernel; Parallel machines; Parallel processing; Program processors; Spatial resolution;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Architectures and Compilation Techniques, 1999. Proceedings. 1999 International Conference on
Conference_Location :
Newport Beach, CA
ISSN :
1089-795X
Print_ISBN :
0-7695-0425-6
Type :
conf
DOI :
10.1109/PACT.1999.807573
Filename :
807573
Link To Document :
بازگشت