DocumentCode
1084953
Title
Processor mapping techniques toward efficient data redistribution
Author
Kalns, Edgar T. ; Ni, Lionel M.
Author_Institution
POWERparallel Syst., IBM Corp., Poughkeepsie, NY, USA
Volume
6
Issue
12
fYear
1995
fDate
12/1/1995 12:00:00 AM
Firstpage
1234
Lastpage
1247
Abstract
Run-time data redistribution can enhance algorithm performance in distributed-memory machines. Explicit redistribution of data can be performed between algorithm phases when a different data decomposition is expected to deliver increased performance for a subsequent phase of computation. Redistribution, however, represents increased program overhead as algorithm computation is discontinued while data are exchanged among processor memories. In this paper, we present a technique that minimizes the amount of data exchange for BLOCK to CYCLIC(c) (or vice-versa) redistributions of arbitrary number of dimensions. Preserving the semantics of the target (destination) distribution pattern, the technique manipulates the data to logical processor mapping of the target pattern. When implemented on an IBM SP, the mapping technique demonstrates redistribution performance improvements of approximately 40% over traditional data to processor mapping. Relative to the traditional mapping technique, the proposed method affords greater flexibility in specifying precisely which data elements are redistributed and which elements remain on-processor
Keywords
FORTRAN; distributed memory systems; parallel languages; parallel programming; processor scheduling; High Performance Fortran; algorithm computation; algorithm performance; algorithm phases; data decomposition; data redistribution; data-parallel programming; distributed-memory architectures; distributed-memory machines; efficient data redistribution; logical processor mapping; processor mapping; processor mapping techniques; program overhead; redistribution performance improvements; run-time data redistribution; target pattern; traditional mapping technique; Computer Society; Computer science; Costs; Runtime; System performance;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/71.476166
Filename
476166
Link To Document