Title :
Bandwidth-efficient collective communication for clustered wide area systems
Author :
Kielmann, Thilo ; Bal, Henri E. ; Gorlatch, Sergei
Author_Institution :
Dept. of Math. & Comput. Sci., Vrije Univ., Amsterdam, Netherlands
Abstract :
Metacomputing infrastructures couple multiple clusters (or MPPs) via wide-area networks. A major problem in programming parallel applications for such platforms is their hierarchical network structure: latency and bandwidth of WANs often are orders of magnitude worse than those of local networks. Our goal is to optimize MPI´s collective operations for such platforms. In this paper we focus on optimized utilization of the (scarce) wide-area bandwidth. We use two techniques: selecting suitable communication graph shapes, and splitting messages into multiple segments that are sent in parallel over different WAN links. To determine the best graph shape and segment size, we introduce a performance model called parameterized LogP (P-LogP), a hierarchical extension of the LogP model that covers messages of arbitrary length. With P-LogP, the optimal segment size and the best broadcast tree shape can be determined at runtime. (For conciseness, we restrict our discussion to the broadcast operation). An experimental performance evaluation shows that the new broadcast has significantly improved performance (for large messages) and that there is a close match between the theoretical model and the measured completion times
Keywords :
parallel programming; performance evaluation; wide area networks; bandwidth-efficient collective communication; clustered wide area systems; communication graph shapes; hierarchical network structure; latency; metacomputing infrastructures; performance evaluation; performance model; wide-area networks; Bandwidth; Broadcasting; Computer science; Concurrent computing; Delay; Libraries; Mathematics; Parallel programming; Runtime; Wide area networks;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2000. IPDPS 2000. Proceedings. 14th International
Conference_Location :
Cancun
Print_ISBN :
0-7695-0574-0
DOI :
10.1109/IPDPS.2000.846026