DocumentCode :
159143
Title :
Symbolic inner loop parallelisation for massively parallel processor arrays
Author :
Tanase, Alexandru ; Witterauf, Michael ; Teich, Jurgen ; Hannig, Frank
Author_Institution :
Dept. of Comput. Sci., Univ. of Erlangen-Nurnberg (FAU), Erlangen, Germany
fYear :
2014
fDate :
19-21 Oct. 2014
Firstpage :
219
Lastpage :
228
Abstract :
This paper presents a first solution to the unsolved problem of symbolically scheduling a given loop nest with uniform data dependences using inner loop parallelization, in particular, the locally parallel, globally sequential (LPGS) mapping technique. This technique is needed in the case of loop program specifications for which the iterations shall be scheduled on a processor array of unknown size at compile time while keeping the local memory consumption independent of the problem size of the mapped loop nest. We show that it is possible to derive such parameterized LPGS schedules statically by proposing a mixed compile-/runtime approach: At compile time, we first determine the set of all schedule candidates, each latency-optimal for a different scanning order of the loop nest. Then we devise an exact parameterized formula for determining the latency of the resulting symbolic schedules, thus making each schedule fully predictable. At runtime, once the size of the processor array becomes known, a simple prolog selects the overall latency-optimal schedule that is then dynamically activated and executed on the processor array. Hence, our approach avoids any further runtime optimization and expensive re-compilations while achieving the same results as computing an optimal static schedule for each possible combination of array and problem size.
Keywords :
formal specification; parallel processing; program compilers; LPGS mapping technique; data dependence; locally parallel globally sequential mapping technique; loop nest; loop program specifications; massively parallel processor arrays; mixed compile-/runtime approach; symbolic inner loop parallelisation; symbolic schedules; Finite impulse response filters; Optimal scheduling; Processor scheduling; Runtime; Schedules; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Formal Methods and Models for Codesign (MEMOCODE), 2014 Twelfth ACM/IEEE International Conference on
Conference_Location :
Lausanne
Type :
conf
DOI :
10.1109/MEMCOD.2014.6961865
Filename :
6961865
Link To Document :
بازگشت