DocumentCode :
1961237
Title :
Two dynamic performance tuning methods for portable parallel programs
Author :
Suzaki, Kuniyasu ; Kurita, Takio ; Tanuma, Hitoshi ; Hirano, Satoshi ; Ichisugi, Yuuji
Author_Institution :
Electrotech. Lab., Tsukuba, Japan
Volume :
2
fYear :
1995
fDate :
19-21 Apr 1995
Firstpage :
585
Abstract :
We present two dynamic performance tuning methods for portable parallel programs on various parallel computers. In parallel programs the affinity between parallel algorithms and the architecture of the target parallel computer is very important. In this paper we focus on the parallelism in view of the number of micro-tasks which are processing units in parallel programs. The presented methods estimate the optimal number of micro-tasks before the parallel processing is invoked. Furthermore, they shorten the execution time of the parallel program so that it is close to the optimal execution time. The estimation is based on the result of pre-executions of the program for different sizes of the data to be processed on a target parallel computer. One tuning method uses nearest-neighbor interpolation and the other uses spline interpolation for the estimation. We tested these tuning methods using a parallel square-matrix multiplication program written in Dataparallel C on three different parallel computers; a Paragon, an iPSC/2, and an nCUBE/2. In these experiments, the method using nearest-neighbor interpolation brought the execution time closer to the optimum than did the method using spline interpolation. The nearest-neighbor interpolation method yielded average execution times, which are given in terms of the optimal execution time, of 1.01 for the Paragon, 1.005 for the iPSC/2, and 1.052 for the nCUBE/2
Keywords :
computational complexity; parallel programming; software performance evaluation; software portability; dynamic performance tuning methods; micro-tasks; nearest-neighbor interpolation; optimal execution time; parallel algorithms; parallel computers; parallel square-matrix multiplication program; portable parallel programs; spline interpolation; Computer architecture; Concurrent computing; Delay; Laboratories; Optimizing compilers; Parallel algorithms; Parallel processing; Parallel programming; Portable computers; Spline;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Algorithms and Architectures for Parallel Processing, 1995. ICAPP 95. IEEE First ICA/sup 3/PP., IEEE First International Conference on
Conference_Location :
Brisbane, Qld.
Print_ISBN :
0-7803-2018-2
Type :
conf
DOI :
10.1109/ICAPP.1995.472244
Filename :
472244
Link To Document :
بازگشت