Title :
Extending collective operations with application semantics for improving multi-cluster performance
Author :
Bongo, Lars Ailo ; Anshus, Otto ; Bjørndalen, John Markus ; Larsen, Tore
Author_Institution :
Dept. of Comput. Sci., Tromso Univ., Norway
Abstract :
We identify two ways of increasing the performance of allreduce-style of collective operations in a multi-cluster with large WAN latencies: (i) hiding latency in system noise, and (ii) conditional-allreduce where knowledge about the application is used to reduce the number of WAN messages. In our multicluster, system noise was not large enough to hide the WAN latency. But, the latency could be hidden using conditional-allreduce, since on many iterations only cluster-local values were needed, and many of the values needed from other clusters were prefetched. A speedup of 2.4 was achieved for a microbenchmark. Prefetching introduced a small overhead in the cluster with the slowest hosts.
Keywords :
application program interfaces; grid computing; message passing; performance evaluation; storage management; wide area networks; workstation clusters; MPI; WAN latency; WAN messages; allreduce-style collective operations; application knowledge; application semantics; cluster-local values; computational grids; conditional-allreduce; latency hiding; message passing interface; microbenchmark; multicluster performance; prefetching; system noise; Application software; Bandwidth; Broadcasting; Clustering algorithms; Delay; Grid computing; Iterative algorithms; Prefetching; Wide area networks; Yarn;
Conference_Titel :
Parallel and Distributed Computing, 2004. Third International Symposium on/Algorithms, Models and Tools for Parallel Computing on Heterogeneous Networks, 2004. Third International Workshop on
Print_ISBN :
0-7695-2210-6
DOI :
10.1109/ISPDC.2004.24