Title :
Scalable, high-performance NIC-based all-to-all broadcast over Myrinet/GM
Author :
Yu, Weikuan ; Panda, Dhabaleswar K. ; Buntinas, Darius
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Abstract :
All-to-all broadcast is one of the common collective operations that involve dense communication between all processes in a parallel program. Previously, programmable network interface cards (NICs) have been leveraged to efficiently support collective operations, including barrier, broadcast, and reduce. This work explores the characteristics of all-to-all broadcast and proposes new algorithms to exploit the potential advantages of NIC programmability. Along with these algorithms, salient strategies have been used to provide scalable topology management, global buffer management, efficient communication processing, and message reliability. The algorithms have been incorporated into a NIC-based collective protocol over Myrinet/GM. The NIC-based all-to-all broadcast operations improve all-to-all broadcast bandwidth over 16 nodes by a factor of 3, compared to host-based all-to-all broadcast operation. Furthermore, the NIC-based operations have been demonstrated to achieve better scalability to large systems and very low host CPU utilization.
Keywords :
message passing; network interfaces; parallel programming; protocols; Myrinet/GM; NIC-based collective protocol; all-to-all broadcast bandwidth; communication processing; global buffer management; high-performance NIC-based all-to-all broadcast; message reliability; parallel programs; programmable network interface cards; scalable NIC-based all-to-all broadcast; topology management; Bandwidth; Broadcasting; Clustering algorithms; Computer networks; Computer science; Concurrent computing; Network interfaces; Protocols; Scalability; Topology;
Conference_Titel :
Cluster Computing, 2004 IEEE International Conference on
Print_ISBN :
0-7803-8694-9
DOI :
10.1109/CLUSTR.2004.1392610