Title :
Software-Controlled Priority Characterization of POWER5 Processor
Author :
Boneti, Carlos ; Cazorla, Francisco J. ; Gioiosa, Roberto ; Buyuktosunoglu, Alper ; Cher, Chen-Yong ; Valero, Mateo
Author_Institution :
Univ. Politec. de Catalunya, Catalunya
Abstract :
Due to the limitations of instruction-level parallelism, thread-level parallelism has become a popular way to improve processor performance. One example is the IBM POWER5TM processor, a two-context simultaneous-multithreaded dual-core chip. In each SMT core, the IBM POWER5 features two levels of thread resource balancing and prioritization. The first level provides automatic in-hardware resource balancing, while the second level is a software-controlled priority mechanism that presents eight levels of thread priorities. Currently, software-controlled prioritization is only used in limited number of cases in the software platforms due to lack of performance characterization of the effects of this mechanism. In this work, we characterize the effects of the software-based prioritization on several different workloads. We show that the impact of the prioritization significantly depends on the workloads coscheduled on a core. By prioritizing the right task, it is possible to obtain more than two times of throughput improvement for synthetic workloads compared to the baseline. We also present two application case studies targeting two different performance metrics: the first case study improves overall throughput by 23.7% and the second case study reduces the total execution time by 9.3%. In addition, we show the circumstances when a background thread can be run transparently without affecting the performance of the foreground thread.
Keywords :
instruction sets; microprocessor chips; multi-threading; resource allocation; software performance evaluation; IBM POWER5 processor; instruction-level parallelism; performance metrics; software-controlled priority characterization; thread resource balancing; thread-level parallelism; two-context simultaneous-multithreaded dual-core chip; Application software; Computer architecture; Hardware; Linux; Parallel processing; Performance loss; Software performance; Surface-mount technology; Throughput; Yarn; IBM POWER5; SMT; performance characterization; simultaneous multithreading; software-controlled prioritization;
Conference_Titel :
Computer Architecture, 2008. ISCA '08. 35th International Symposium on
Conference_Location :
Beijing
Print_ISBN :
978-0-7695-3174-8
DOI :
10.1109/ISCA.2008.8