DocumentCode :
572342
Title :
Scalable power control for many-core architectures running multi-threaded applications
Author :
Ma, Kai ; Li, Xue ; Chen, Ming ; Wang, Xiaorui
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Univ. of Tennessee, Knoxville, TN, USA
fYear :
2011
fDate :
4-8 June 2011
Firstpage :
449
Lastpage :
460
Abstract :
Optimizing the performance of a multi-core microprocessor within a power budget has recently received a lot of attention. However, most existing solutions are centralized and cannot scale well with the rapidly increasing level of core integration. While a few recent studies propose power control algorithms for many-core architectures, those solutions assume that the workload of every core is independent and therefore cannot effectively allocate power based on thread criticality to accelerate multi-threaded parallel applications, which are expected to be the primary workloads of many-core architectures. This paper presents a scalable power control solution for many-core microprocessors that is specifically designed to handle realistic workloads, i.e., a mixed group of single-threaded and multi-threaded applications. Our solution features a three-layer design. First, we adopt control theory to precisely control the power of the entire chip to its chip-level budget by adjusting the aggregated frequency of all the cores on the chip. Second, we dynamically group cores running the same applications and then partition the chip-level aggregated frequency quota among different groups for optimized overall microprocessor performance. Finally, we partition the group-level frequency quota among the cores in each group based on the measured thread criticality for shorter application completion time. As a result, our solution can optimize the microprocessor performance while precisely limiting the chip-level power consumption below the desired budget. Empirical results on a 12-core hardware testbed show that our control solution can provide precise power control, as well as 17% and 11% better application performance than two state-of-the-art solutions, on average, for mixed PARSEC and SPEC benchmarks. Furthermore, our extensive simulation results for 32, 64, and 128 cores, as well as overhead analysis for up to 4,096 cores, demonstrate that our solution is highly scalable to man- -core architectures.
Keywords :
microprocessor chips; multi-threading; parallel architectures; power aware computing; power control; PARSEC benchmark; SPEC benchmark; application completion time; chip power control; chip-level aggregated frequency partitioning; chip-level budget; chip-level power consumption; control theory; core integration; group-level frequency partitioning; many-core architecture; many-core microprocessor; microprocessor performance optimization; multicore microprocessor; multithreaded parallel applications; power budget; power control algorithms; scalable power control; scalable power control solution; single-threaded applications; three-layer design; Frequency control; Instruction sets; Microprocessors; Multicore processing; Power control; Power demand; Semiconductor device measurement; Chip multiprocessor; control theory; many-core architecture; power capping; power control; scalability; thread criticality;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Architecture (ISCA), 2011 38th Annual International Symposium on
Conference_Location :
San Jose, CA
ISSN :
1063-6897
Print_ISBN :
978-1-4503-0472-6
Type :
conf
Filename :
6307635
Link To Document :
بازگشت