DocumentCode :
1350626
Title :
Lightweight Chip Multi-Threading (LCMT): Maximizing Fine-Grained Parallelism On-Chip
Author :
Li, Sheng ; Kuntz, Shannon ; Brockman, Jay ; Kogge, Peter
Author_Institution :
Dept. of Electr. Eng., Univ. of Notre Dame, Notre Dame, IN, USA
Volume :
22
Issue :
7
fYear :
2011
fDate :
7/1/2011 12:00:00 AM
Firstpage :
1178
Lastpage :
1191
Abstract :
Irregular and dynamic applications, such as graph problems and agent-based simulations, often require fine-grained parallelism to achieve good performance. However, current multicore processors only provide architectural support for coarse-grained parallelism, making it necessary to use software-based multithreading environments to effectively implement fine-grained parallelism. Although these software-based environments have demonstrated superior performance over heavyweight, OS-level threads, they are still limited by the significant overhead involved in thread management and synchronization. In order to address this, we propose a Lightweight Chip Multi-Threaded (LCMT) architecture that further exploits thread-level parallelism (TLP) by incorporating direct architectural support for an “unlimited” number of dynamically created lightweight threads with very low thread management and synchronization overhead. The LCMT architecture can be implemented atop a mainstream architecture with minimum extra hardware to leverage existing legacy software environments. We compare the LCMT architecture with a Niagara-like baseline architecture. Our results show up to 1.8X better scalability, 1.91X better performance, and more importantly, 1.74X better performance per watt, using the LCMT architecture for irregular and dynamic benchmarks, when compared to the baseline architecture. The LCMT architecture delivers similar performance to the baseline architecture for regular benchmarks.
Keywords :
microprocessor chips; multi-threading; multiprocessing systems; parallel architectures; Niagara like baseline architecture; OS level threads; agent based simulations; coarse grained parallelism; fine grained parallelism on-chip; graph problems; legacy software environments; lightweight chip multithreading; multicore processors; software based multithreading environments; thread level parallelism; Computer architecture; Hardware; Instruction sets; Medical services; Parallel processing; Registers; Multithreaded processors; irregular applications.; multicore processors; unlimited multithreading;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2010.169
Filename :
5601707
Link To Document :
بازگشت