DocumentCode :
2523909
Title :
Algorithms for Mapping Parallel Processes onto Grid and Torus Architectures
Author :
Glantz, Roland ; Meyerhenke, Hening ; Noe, Alexander
Author_Institution :
Inst. of Theor. Inf., Karlsruhe Inst. of Technol. (KIT), Karlsruhe, Germany
fYear :
2015
fDate :
4-6 March 2015
Firstpage :
236
Lastpage :
243
Abstract :
Static mapping is the assignment of parallel processes to the processing elements (PEs) of a parallel system, where the assignment does not change during the application´s lifetime. In our scenario we model an application´s computations and their dependencies by an application graph. This graph is first partitioned into (nearly) equally sized blocks. These blocks need to communicate at block boundaries. To assign the processes to PEs, our goal is to compute a communication-efficient bijective mapping between the blocks and the PEs. This approach of partitioning followed by bijective mapping has many degrees of freedom. Thus, users and developers of parallel applications need to know more about which choices work for which application graphs and which parallel architectures. To this end, we not only develop new mapping algorithms (derived from known greedy methods). We also perform extensive experiments involving different classes of application graphs (meshes and complex networks), architectures of parallel computers (grids and tori), as well as different partitioners and mapping algorithms. Surprisingly, the quality of the partitions, unless very poor, has little influence on the quality of the mapping. More importantly, one of our new mapping algorithms always yields the best results in terms of the quality measure maximum congestion when the application graphs are complex networks. In case of meshes as application graphs, this mapping algorithm always leads in terms of maximum congestion AND maximum dilation, another common quality measure.
Keywords :
graph theory; parallel algorithms; parallel architectures; application graphs; communication-efficient bijective mapping; greedy method; grid architecture; maximum congestion; maximum dilation; parallel architectures; parallel process mapping; parallel system; static mapping; torus architecture; Complex networks; Computational modeling; Computer architecture; Computers; Informatics; Partitioning algorithms; Topology; Topology mapping; grids tori; new variant of greedy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel, Distributed and Network-Based Processing (PDP), 2015 23rd Euromicro International Conference on
Conference_Location :
Turku
ISSN :
1066-6192
Type :
conf
DOI :
10.1109/PDP.2015.21
Filename :
7092726
Link To Document :
بازگشت