DocumentCode :
661796
Title :
Hardware Acceleration of Barrier Communication for Large Scale Parallel Computer
Author :
Pang Zhengbin ; Wang Shaogang ; Wu Dan ; Lu Pingjing
Author_Institution :
Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
fYear :
2013
fDate :
14-16 Aug. 2013
Firstpage :
610
Lastpage :
614
Abstract :
MPI collective communication overhead dominates the communication cost for large scale parallel computers, scalability and operation latency for collective communication is critical for next generation computers. This paper proposes a fast and scalable barrier communication offload approach which supports millions of compute cores. Following our approach, the barrier operation sequence is packed by host MPI driver into the barrier “descriptor”, which is pushed to the NIC (Network-Interfaces). The NIC can complete the barrier automatically following its algorithm descriptor. Our approach accelerates both intra-node and inter-node barrier communication. We show that our approach achieves both barrier performance and scalability, especially for large scale computer system. This paper also proposes an extendable and easy-to-implement NIC architecture supporting barrier offload communication and also other communication pattern.
Keywords :
message passing; parallel processing; peripheral interfaces; MPI collective communication overhead; MPI driver; NIC architecture; barrier communication offload approach; hardware acceleration; internode barrier communication; intranode barrier communication; large scale computer system; large scale parallel computer; network-interfaces; next generation computers; operation latency; Computers; Delays; Engines; Hardware; Message systems; Radiation detectors; Software;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications and Networking in China (CHINACOM), 2013 8th International ICST Conference on
Conference_Location :
Guilin
Type :
conf
DOI :
10.1109/ChinaCom.2013.6694666
Filename :
6694666
Link To Document :
بازگشت