DocumentCode :
2680820
Title :
Identifying the optimal energy-efficient operating points of parallel workloads
Author :
Cochran, Ryan ; Hankendi, Can ; Coskun, Ayse ; Reda, Sherief
Author_Institution :
Sch. of Eng., Brown Univ., Providence, RI, USA
fYear :
2011
fDate :
7-10 Nov. 2011
Firstpage :
608
Lastpage :
615
Abstract :
As the number of cores per processor grows, there is a strong incentive to develop parallel workloads to take advantage of the hardware parallelism. In comparison to single-threaded applications, parallel workloads are more complex to characterize due to thread interactions and resource stalls. This paper presents an accurate and scalable method for determining the optimal system operating points (i.e., number of threads and DVFS settings) at runtime for parallel workloads under a set of objective functions and constraints that optimize for energy efficiency in multi-core processors. Using an extensive training data set gathered for a wide range of parallel workloads on a commercial multi-core system, we construct multinomial logistic regression (MLR) models that estimate the optimal system settings as a function of workload characteristics. We use L1-regularization to automatically determine the relevant workload metrics for energy optimization. At runtime, our technique determines the optimal number of threads and the DVFS setting with negligible overhead. Our experiments demonstrate that our method outperforms prior techniques with up to 51% improved decision accuracy. This translates to up to 10.6% average improvement in energy-performance operation, with a maximum improvement of 30.9%. Our technique also demonstrates superior scalability as the number of potential system operating points increases.
Keywords :
multiprocessing systems; power aware computing; regression analysis; DVFS settings; L1-regularization; energy optimization; hardware parallelism; multicore processors; multinomial logistic regression models; objective functions; optimal energy-efficient operating point identification; optimal system operating points; parallel workloads; resource stalls; thread interactions; Accuracy; Delay; Logistics; Mathematical model; Runtime; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer-Aided Design (ICCAD), 2011 IEEE/ACM International Conference on
Conference_Location :
San Jose, CA
ISSN :
1092-3152
Print_ISBN :
978-1-4577-1399-6
Electronic_ISBN :
1092-3152
Type :
conf
DOI :
10.1109/ICCAD.2011.6105393
Filename :
6105393
Link To Document :
بازگشت