DocumentCode :
2845655
Title :
MFLUSH: Handling Long-Latency Loads in SMT On-Chip Multiprocessors
Author :
Acosta, Carmelo ; Cazorla, Francisco J. ; Ramirez, Alex ; Valero, Mateo
Author_Institution :
HiPEAC Eur. Network of Excellence, Univ. Politec. de Catalunya, Barcelona
fYear :
2008
fDate :
9-12 Sept. 2008
Firstpage :
173
Lastpage :
181
Abstract :
Nowadays, there is a clear trend in industry towards employing the growing amount of transistors on chip in replicating execution cores (CMP), where each core is simultaneous multithreading (SMT). State-of-the-art high-performance processors like the IBM POWER5 and POWER6 corroborate this CMP+SMT trend. Within each SMT core any of the well-known SMT mechanisms may be applied to face SMT related challenges. Among them, probably the most important issue in an SMT execution pipeline concerns the instruction fetch (IFetch) Policy. The FLUSH IFetch Policy represents a choice for throughput-oriented scenarios. It handles L2 cache misses in order to avoid hardware resource monopolization by any given execution thread; involving an additional energy cost via instruction refetching. However, the new constraints imposed by the CMP+SMT scenario may affect well-known SMT mechanisms, like the FLUSH mechanism. In this paper we revisit the FLUSH mechanism and analyze its application in the emerging CMP+SMT scenario. The included analysis points out the new difficulties to be faced by the FLUSH mechanism in the emerging CMP+SMT scenario. Then we propose a novel IFetch Policy designed to cope with the CMP+SMT scenario: the MFLUSH. We also include a complete evaluation of the MFLUSH policy, both in terms of throughput and energy consumption. Our results indicate that the MFLUSH, specifically designed for the emerging CMP+SMT scenario, succeeds not only in overcoming the specific CMP+SMT constraints but also allowing a 20% energy consumption reduction without a significant system throughput loss.
Keywords :
cache storage; multi-threading; multiprocessing systems; storage management; FLUSH mechanism; IBM POWER5; IBM POWER6; MFLUSH; SMT execution pipeline; SMT on-chip multiprocessors; high-performance processors; instruction fetch policy; long-latency load; replicating execution cores; simultaneous multithreading; Delay; Energy consumption; Multithreading; Network-on-a-chip; Parallel processing; Pipelines; Surface-mount technology; Switches; Throughput; Yarn; CMP; Instruction Fetch Policy; SMT;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing, 2008. ICPP '08. 37th International Conference on
Conference_Location :
Portland, OR
ISSN :
0190-3918
Print_ISBN :
978-0-7695-3374-2
Electronic_ISBN :
0190-3918
Type :
conf
DOI :
10.1109/ICPP.2008.48
Filename :
4625847
Link To Document :
بازگشت