DocumentCode
3403669
Title
Latency-Balanced Optimization of MPI Collective Communication across Multi-clusters
Author
Xiaohui Wei ; Jun Cheng ; Hongliang Li
Author_Institution
Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
fYear
2013
fDate
22-23 Aug. 2013
Firstpage
9
Lastpage
13
Abstract
Recently, the demands of resource sharing and cooperation across multiple HPC centers are growing rapidly. Such multi-cluster HPC systems provide much more computing resources than single cluster. It challenges the scalability and performance of current parallel computing tools, especially the collective communication of parallel applications. This paper focuses on the optimization of collective communications of parallel applications across multiple HPC centers. A scalable latency-balanced broadcast algorithm (LB-B) for MPI across multi-cluster is proposed in this paper. It can dynamically adapt to the topology of the system and improve the communication performance.
Keywords
application program interfaces; computer network performance evaluation; message passing; parallel algorithms; resource allocation; workstation clusters; MPI collective communication; communication performance improvement; latency-balanced optimization; message passing interface; multicluster HPC systems; multiple HPC centers; parallel computing tool performance; parallel computing tool scalability; resource cooperation; resource sharing; scalable LB-B algorithm; scalable latency-balanced broadcast algorithm; system topology; Algorithm design and analysis; Clustering algorithms; Conferences; Distributed processing; Network topology; Optimization; Topology; Collective Communication; MPI; Multi-cluster;
fLanguage
English
Publisher
ieee
Conference_Titel
ChinaGrid Annual Conference (ChinaGrid), 2013 8th
Conference_Location
Changchun
Print_ISBN
978-0-7695-5058-9
Type
conf
DOI
10.1109/ChinaGrid.2013.26
Filename
6623859
Link To Document