مرکز منطقه ای اطلاع رساني علوم و فناوري - Compilation scheme for near fine grain parallel processing on a multiprocessor system without explicit synchronization

DocumentCode :

2846685

Title :

Compilation scheme for near fine grain parallel processing on a multiprocessor system without explicit synchronization

Author :

Ogata, Wataru ; Fujimoto, Kensaku ; Oota, Masato ; Kasahara, Hironori

Author_Institution :

Dept. of Electr. Eng., Waseda Univ., Tokyo, Japan

fYear :

1995

fDate :

17-19 May 1995

Firstpage :

327

Lastpage :

332

Abstract :

In Fortran parallelizing compilers for multiprocessor systems, a loop parallelizing scheme has been used. However, there still exist loops to which the Do-all and Do-across techniques cannot be effectively applied because of loop carried dependence and conditional branches to the outside of the loops. Also, the compiler do not exploit the parallelism of the subroutines, loops and basic blocks and the near-fine-grain parallelism inside the basic blocks in the outside of loops or in sequential loops. Therefore, it is important to use coarse-grain parallelism and near-fine-grain parallelism in addition to loop parallelization. Taking into consideration the above facts, the authors propose a multigrain parallel processing scheme which combines coarse-grain parallel processing or macro-data flow processing, loop concurrency, and a near-fine-grain parallel processing hierarchy. To minimize the data transfer overhead and the total processing time, the proposed compilation scheme uses a static scheduling algorithm called CP/DT/MISF (critical path/data transfer/most immediate successors first). Also, to minimize the synchronization overhead, the compilation scheme eliminates all synchronization codes by using machine-clock level precise code scheduling for a target multiprocessor system OSCAR. This scheme has been implemented on OSCAR and a performance evaluation on OSCAR shows the proposed near-fine-grain parallel processing without synchronization reduces the processing time of test programs by 30% to 40% compared with conventional near-fine-grain parallel processing with synchronization codes

Keywords :

FORTRAN; data flow computing; multiprocessing systems; parallel architectures; parallelising compilers; performance evaluation; processor scheduling; CP/DT/MISF; Fortran parallelizing compilers; OSCAR; coarse-grain parallel processing; critical path/data transfer/most immediate successors first; data transfer overhead; loop concurrency; loop parallelization; machine-clock level precise code scheduling; macro-data flow processing; multigrain parallel processing; multiprocessor system; multiprocessor systems; near fine grain parallel processing; performance evaluation; sequential loops; static scheduling algorithm; subroutines; synchronization overhead; total processing time; Hardware; Multiprocessing systems; Parallel processing; Processor scheduling; Scheduling algorithm; Testing; Timing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Communications, Computers, and Signal Processing, 1995. Proceedings., IEEE Pacific Rim Conference on

Conference_Location :

Victoria, BC

Print_ISBN :

0-7803-2553-2

Type :

conf

DOI :

10.1109/PACRIM.1995.519536

Filename :

519536

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2846685