DocumentCode :
2846685
Title :
Compilation scheme for near fine grain parallel processing on a multiprocessor system without explicit synchronization
Author :
Ogata, Wataru ; Fujimoto, Kensaku ; Oota, Masato ; Kasahara, Hironori
Author_Institution :
Dept. of Electr. Eng., Waseda Univ., Tokyo, Japan
fYear :
1995
fDate :
17-19 May 1995
Firstpage :
327
Lastpage :
332
Abstract :
In Fortran parallelizing compilers for multiprocessor systems, a loop parallelizing scheme has been used. However, there still exist loops to which the Do-all and Do-across techniques cannot be effectively applied because of loop carried dependence and conditional branches to the outside of the loops. Also, the compiler do not exploit the parallelism of the subroutines, loops and basic blocks and the near-fine-grain parallelism inside the basic blocks in the outside of loops or in sequential loops. Therefore, it is important to use coarse-grain parallelism and near-fine-grain parallelism in addition to loop parallelization. Taking into consideration the above facts, the authors propose a multigrain parallel processing scheme which combines coarse-grain parallel processing or macro-data flow processing, loop concurrency, and a near-fine-grain parallel processing hierarchy. To minimize the data transfer overhead and the total processing time, the proposed compilation scheme uses a static scheduling algorithm called CP/DT/MISF (critical path/data transfer/most immediate successors first). Also, to minimize the synchronization overhead, the compilation scheme eliminates all synchronization codes by using machine-clock level precise code scheduling for a target multiprocessor system OSCAR. This scheme has been implemented on OSCAR and a performance evaluation on OSCAR shows the proposed near-fine-grain parallel processing without synchronization reduces the processing time of test programs by 30% to 40% compared with conventional near-fine-grain parallel processing with synchronization codes
Keywords :
FORTRAN; data flow computing; multiprocessing systems; parallel architectures; parallelising compilers; performance evaluation; processor scheduling; CP/DT/MISF; Fortran parallelizing compilers; OSCAR; coarse-grain parallel processing; critical path/data transfer/most immediate successors first; data transfer overhead; loop concurrency; loop parallelization; machine-clock level precise code scheduling; macro-data flow processing; multigrain parallel processing; multiprocessor system; multiprocessor systems; near fine grain parallel processing; performance evaluation; sequential loops; static scheduling algorithm; subroutines; synchronization overhead; total processing time; Hardware; Multiprocessing systems; Parallel processing; Processor scheduling; Scheduling algorithm; Testing; Timing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Computers, and Signal Processing, 1995. Proceedings., IEEE Pacific Rim Conference on
Conference_Location :
Victoria, BC
Print_ISBN :
0-7803-2553-2
Type :
conf
DOI :
10.1109/PACRIM.1995.519536
Filename :
519536
Link To Document :
بازگشت