Title :
High performance MPI library over SR-IOV enabled infiniband clusters
Author :
Jie Zhang ; Xiaoyi Lu ; Jose, Jithin ; Mingzhe Li ; Rong Shi ; Panda, Dhabaleswar K. D. K.
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Abstract :
Virtualization has become a central role in HPC Cloud due to easy management and low cost of computation and communication. Recently, Single Root I/O Virtualization (SR-IOV) technology has been introduced for high-performance interconnects such as InfiniBand and can attain near to native performance for inter-node communication. However, the SR-IOV scheme lacks locality aware communication support, which leads to performance overheads for inter-VM communication within a same physical node. To address this issue, this paper first proposes a high performance design of MPI library over SR-IOV enabled InfiniBand clusters by dynamically detecting VM locality and coordinating data movements between SR-IOV and Inter-VM shared memory (IVShmem) channels. Through our proposed design, MPI applications running in virtualized mode can achieve efficient locality-aware communication on SR-IOV enabled InfiniBand clusters. In addition, we optimize communications in IVShmem and SR-IOV channels by analyzing the performance impact of core mechanisms and parameters inside MPI library to deliver better performance in virtual machines. Finally, we conduct comprehensive performance studies by using point-to-point and collective benchmarks, and HPC applications. Experimental evaluations show that our proposed MPI library design can significantly improve the performance for point-to-point and collective operations, and MPI applications with different InfiniBand transport protocols (RC and UD) by up to 158%, 76%, 43%, respectively, compared with SR-IOV. To the best of our knowledge, this is the first study to offer a high performance MPI library that supports efficient locality aware MPI communication over SR-IOV enabled InfiniBand clusters.
Keywords :
application program interfaces; cloud computing; message passing; parallel processing; shared memory systems; software libraries; transport protocols; virtual machines; workstation clusters; HPC cloud; IVShmem channels; InfiniBand transport protocols; RC; SR-IOV channel; SR-IOV enabled InfiniBand clusters; SR-IOV enabled infiniband clusters; SR-IOV technology; UD; collective benchmark; collective operation; communication optimization; high performance MPI library; high performance design; high-performance interconnects; interVM communication; interVM shared memory channels; internode communication; locality-aware communication; performance impact analysis; point-to-point benchmark; point-to-point operation; single root I/O virtualization technology; virtual machines; virtualized mode; Bandwidth; Cloud computing; Detectors; Kernel; Libraries; Performance evaluation; Virtualization; IVShmem; InfiniBand; MPI; SR-IOV; Virtualization;
Conference_Titel :
High Performance Computing (HiPC), 2014 21st International Conference on
Print_ISBN :
978-1-4799-5975-4
DOI :
10.1109/HiPC.2014.7116876