DocumentCode
2860971
Title
A Thread-Level Pipeline Parallel Model for CMP
Author
Huifang Guo ; Rongcai Zhao ; Kunpeng Jiang ; Jin Qu
Author_Institution
Nat. Digital Switching Syst. Eng. & Technol. Res. Center, Zhengzhou, China
fYear
2009
fDate
11-13 Dec. 2009
Firstpage
1
Lastpage
4
Abstract
Today, with the increasing popularity of chip multiprocessors (CMPs), the memory wall problem becomes more serious. So, making better use of the shared cache on chip is more necessary on CMP than other multiple processors architecture. In this paper, we analyze the performance of traditional special decomposed parallel implementation of red-black algorithm, and find that this parallel model does not exploit the temporary data locality of this application. Then, we restructure red-black algorithm to be a producer-consumer thread pipeline. Under this thread-level pipeline model, consumer threads can reuse the data that the former producers have fetched into the shared cache. Then the number of cache miss reduces. Our experiment results show the application performance under the thread-level pipeline parallel model achieves about 40% additional improvement on core 2. Furthermore, we propose a synchronization mechanism in hardware to support this model, and discuss the scalability of this parallel model.
Keywords
microprocessor chips; parallel architectures; chip multiprocessors; multiple processors architecture; producer-consumer thread pipeline; red-black algorithm; thread-level pipeline parallel model; Algorithm design and analysis; Delay; Parallel processing; Performance analysis; Pipelines; Programming profession; Scalability; Switching systems; Systems engineering and theory; Yarn;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-4507-3
Electronic_ISBN
978-1-4244-4507-3
Type
conf
DOI
10.1109/CISE.2009.5366040
Filename
5366040
Link To Document