DocumentCode
2011278
Title
Software-Controlled Priority Characterization of POWER5 Processor
Author
Boneti, Carlos ; Cazorla, Francisco J. ; Gioiosa, Roberto ; Buyuktosunoglu, Alper ; Cher, Chen-Yong ; Valero, Mateo
Author_Institution
Univ. Politec. de Catalunya, Catalunya
fYear
2008
fDate
21-25 June 2008
Firstpage
415
Lastpage
426
Abstract
Due to the limitations of instruction-level parallelism, thread-level parallelism has become a popular way to improve processor performance. One example is the IBM POWER5TM processor, a two-context simultaneous-multithreaded dual-core chip. In each SMT core, the IBM POWER5 features two levels of thread resource balancing and prioritization. The first level provides automatic in-hardware resource balancing, while the second level is a software-controlled priority mechanism that presents eight levels of thread priorities. Currently, software-controlled prioritization is only used in limited number of cases in the software platforms due to lack of performance characterization of the effects of this mechanism. In this work, we characterize the effects of the software-based prioritization on several different workloads. We show that the impact of the prioritization significantly depends on the workloads coscheduled on a core. By prioritizing the right task, it is possible to obtain more than two times of throughput improvement for synthetic workloads compared to the baseline. We also present two application case studies targeting two different performance metrics: the first case study improves overall throughput by 23.7% and the second case study reduces the total execution time by 9.3%. In addition, we show the circumstances when a background thread can be run transparently without affecting the performance of the foreground thread.
Keywords
instruction sets; microprocessor chips; multi-threading; resource allocation; software performance evaluation; IBM POWER5 processor; instruction-level parallelism; performance metrics; software-controlled priority characterization; thread resource balancing; thread-level parallelism; two-context simultaneous-multithreaded dual-core chip; Application software; Computer architecture; Hardware; Linux; Parallel processing; Performance loss; Software performance; Surface-mount technology; Throughput; Yarn; IBM POWER5; SMT; performance characterization; simultaneous multithreading; software-controlled prioritization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Architecture, 2008. ISCA '08. 35th International Symposium on
Conference_Location
Beijing
ISSN
1063-6897
Print_ISBN
978-0-7695-3174-8
Type
conf
DOI
10.1109/ISCA.2008.8
Filename
4556744
Link To Document