Title :
Fast barriers for scalable ccNUMA systems
Author :
Cheng, Liqun ; Carter, John B.
Author_Institution :
Sch. of Comput., Utah Univ., Salt Lake City, UT, USA
Abstract :
The contributions of this paper are threefold. First, we identify and quantify the performance deficiencies of conventional barrier implementations when they are executed on real (non-idealized) hardware. Second, we propose a queue-based barrier algorithm that has effectively O(1) time complexity as measured in round trip message latencies. Third, we demonstrate how matching the barrier implementation to the way that modern shared memory systems operate can improve performance dramatically by exploiting a hardware write-update (PUT) mechanism for signaling. The resulting barrier algorithm only costs one serialized round trip message latency to perform a barrier operation across N processors. Using a cycle-accurate execution-driven simulator of a future-generation SGI multiprocessor, we show that with no special hardware support our queue-based barrier outperforms OpenMP´s LL/SC-based barrier implementation by a factor of 7.9 on 256 processors. With hardware that supports a coherent PUT operation, our queue-based barrier outperforms OpenMP barriers by a factor of 94 and outperforms barriers based on SGI´s memory controller-based atomic operations by a factor of 6.5 on 256 processors.
Keywords :
computational complexity; message passing; microprogramming; performance evaluation; queueing theory; shared memory systems; PUT; cycle-accurate execution-driven simulator; future-generation SGI multiprocessor; hardware write-update mechanism; queue-based barrier algorithm; round trip message latency; scalable ccNUMA systems; shared memory multiprocessor systems; time complexity; Cities and towns; Computer networks; Costs; Delay; Hardware; Multiprocessing systems; Performance analysis; Scalability; Time measurement; Yarn;
Conference_Titel :
Parallel Processing, 2005. ICPP 2005. International Conference on
Print_ISBN :
0-7695-2380-3
DOI :
10.1109/ICPP.2005.39