Title :
Accelerating List Management for MPI
Author :
Underwood, Keith D. ; Rodrigues, Arun ; Hemmert, K. Scott
Author_Institution :
Sandia Nat. Labs., Albuquerque, NM
Abstract :
The latency and throughput of MPI messages are critically important to a range of parallel scientific applications. In many modern networks, both of these performance characteristics are largely driven by the performance of a processor on the network interface. Because of the semantics of MPI, this embedded processor is forced to traverse a linked list of posted receives each time a messages is received. As this list grows long, the latency of message reception grows and the throughput of MPI messages decreases. This paper presents a novel hardware feature to handle list management functions on a network interface. By moving functions such as list insertion, list traversal, and list deletion to the hardware unit, latencies are decreased by up to 20% in the zero length queue case with dramatic improvements in the presence of long queues. Similarly, the throughput is increased by up to 10% in the zero length queue case and by nearly 100% in the presence queues of 30 messages
Keywords :
message passing; network interfaces; MPI messages; MPI semantics; embedded processor; hardware feature; list management acceleration; network interface; parallel scientific applications; Acceleration; Bandwidth; Delay; Hardware; Laboratories; Message passing; Network interfaces; Pipelines; Throughput; US Department of Energy;
Conference_Titel :
Cluster Computing, 2005. IEEE International
Conference_Location :
Burlington, MA
Print_ISBN :
0-7803-9486-0
Electronic_ISBN :
1552-5244
DOI :
10.1109/CLUSTR.2005.347036