مرکز منطقه ای اطلاع رساني علوم و فناوري - An efficient algorithm for the run-time parallelization of DOACROSS loops

DocumentCode :

2280301

Title :

An efficient algorithm for the run-time parallelization of DOACROSS loops

Author :

Chen, Ding-Kai ; Torrellas, Josep ; Yew, Pen-Chung

Author_Institution :

Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA

fYear :

1994

fDate :

14-18 Nov 1994

Firstpage :

518

Lastpage :

527

Abstract :

While automatic parallelization of loops usually relies on compile time analysis of data dependences, for some loops the data dependences cannot be determined at compile time. An example is loops accessing arrays with subscripted subscripts. To parallelize these loops, it is necessary to perform run time analysis. We present a new algorithm to parallelize these loops at run time. Our scheme handles any type of data dependence in the loop without requiring any special architectural support in the multiprocessor. Furthermore, compared to an older scheme with the same generality, our scheme significantly reduces the amount of processor communication required and increases the overlap among dependent iterations. We evaluate our algorithm with parameterized loops running on the 32-processor Cedar shared memory multiprocessor. The results show speedups over the serial code of up to 14 with the full overhead of run time analysis and of up to 27 if part of the analysis is reused across loop invocations. Moreover, the algorithm outperforms the older scheme in nearly all cases, reaching speedups of up to times when the loop has many dependences

Keywords :

automatic programming; parallel algorithms; parallel programming; shared memory systems; 32-processor Cedar shared memory multiprocessor; DOACROSS loops; architectural support; automatic parallelization; compile time analysis; data dependences; dependent iterations; loop invocations; multiprocessor; parameterized loops; processor communication; run time analysis; run-time parallelization; serial code; subscripted subscripts; Computer graphics; Concurrent computing; Data analysis; NASA; Parallel processing; Performance analysis; Research and development; Runtime; Silicon; US Department of Energy;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Supercomputing '94., Proceedings

Conference_Location :

Washington, DC

Print_ISBN :

0-8186-6605-6

Type :

conf

DOI :

10.1109/SUPERC.1994.344315

Filename :

344315

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2280301