Title :
Latency-Balanced Optimization of MPI Collective Communication across Multi-clusters
Author :
Xiaohui Wei ; Jun Cheng ; Hongliang Li
Author_Institution :
Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
Abstract :
Recently, the demands of resource sharing and cooperation across multiple HPC centers are growing rapidly. Such multi-cluster HPC systems provide much more computing resources than single cluster. It challenges the scalability and performance of current parallel computing tools, especially the collective communication of parallel applications. This paper focuses on the optimization of collective communications of parallel applications across multiple HPC centers. A scalable latency-balanced broadcast algorithm (LB-B) for MPI across multi-cluster is proposed in this paper. It can dynamically adapt to the topology of the system and improve the communication performance.
Keywords :
application program interfaces; computer network performance evaluation; message passing; parallel algorithms; resource allocation; workstation clusters; MPI collective communication; communication performance improvement; latency-balanced optimization; message passing interface; multicluster HPC systems; multiple HPC centers; parallel computing tool performance; parallel computing tool scalability; resource cooperation; resource sharing; scalable LB-B algorithm; scalable latency-balanced broadcast algorithm; system topology; Algorithm design and analysis; Clustering algorithms; Conferences; Distributed processing; Network topology; Optimization; Topology; Collective Communication; MPI; Multi-cluster;
Conference_Titel :
ChinaGrid Annual Conference (ChinaGrid), 2013 8th
Conference_Location :
Changchun
Print_ISBN :
978-0-7695-5058-9
DOI :
10.1109/ChinaGrid.2013.26