DocumentCode
2309982
Title
Software Barrier Performance on Dual Quad-Core Opterons
Author
Chen, Jie ; Watson, William
Author_Institution
Sci. Comput. Group, Jefferson Lab., Newport News, VA
fYear
2008
fDate
12-14 June 2008
Firstpage
303
Lastpage
309
Abstract
Multi-core processors based SMP servers have become building blocks for Linux clusters in recent years because they can deliver better performance for multi-threaded programs through on-chip multi-threading. However, a relative slow software barrier can hinder the performance of a data-parallel scientific application on a multi-core system. In this paper we study the performance of different software barrier algorithms on a server based on newly introduced AMD quad-core Opteron processors. We study how the memory architecture and the cache coherence protocol of the system influence the performance of barrier algorithms. We present an optimized barrier algorithm derived from the queue-based barrier algorithm. We find that the optimized barrier algorithm achieves speedup of 1.77 over the original queue-based algorithm. In addition, it has speedup of 2.39 over the software barrier generated by the Intel OpenMP compiler.
Keywords
Linux; memory architecture; multi-threading; program compilers; Intel OpenMP compiler; Linux clusters; cache coherence protocol; data-parallel scientific application; dual quad-core Opterons; memory architecture; multicore processors; multithreaded programs; onchip multithreading; queue-based barrier algorithm; software barrier performance; Computer architecture; Hardware; High performance computing; Linux; Multicore processing; Protocols; Software algorithms; Software performance; Testing; Yarn; Centralized Barrier; OpenMP; Opteron; Queue-based Barrier; Software Barrier;
fLanguage
English
Publisher
ieee
Conference_Titel
Networking, Architecture, and Storage, 2008. NAS '08. International Conference on
Conference_Location
Chongqing
Print_ISBN
978-0-7695-3187-8
Type
conf
DOI
10.1109/NAS.2008.27
Filename
4579607
Link To Document