DocumentCode :
634806
Title :
Locality-aware power optimization and measurement methodology for PGAS workloads on SMP clusters
Author :
Newsom, David K. ; Azari, Sardar F. ; Anbar, Ahmad ; El-Ghazawi, Tarek
Author_Institution :
Dept. of Electr. & Comput. Eng., George Washington Univ., Washington, DC, USA
fYear :
2013
fDate :
27-29 June 2013
Firstpage :
1
Lastpage :
10
Abstract :
Reducing energy consumption without affecting computational performance is a significant research driver in computer engineering. The Partitioned Global Address Space (PGAS) programming model provides a global address space for ease-of-use while providing locality-awareness for efficient execution. For symmetric multiprocessor (SMP) clusters, PGAS locality-awareness offers opportunities for intelligent energy management due to the inherent latencies in the interconnection network fabric. In this paper, the second in a series that explores PGAS power optimization, we present a detailed examination of how data locality-awareness can be exploited to improve the energy consumption of workload sharing on SMP clusters. We present a systematic methodology for analyzing a given algorithm´s potential for locality-aware power optimization. We show how to automate the code modifications for an iterative stencil loop algorithm using a profiler that analyzes the algorithms´ non-local access patterns and optimizes the code for thread-local data pre-fetch. We explore the use of dynamic voltage frequency scaling (DVFS) in conjunction with time-based optimizations and provide an extensible and precise power measurement framework for generating the experimental results. We compare optimized and CPU-default approaches to DVFS application at the program-execution and processor architecture level. We show how the latency of the cluster´s interconnection network can impact the opportunities for power optimization by comparing the programs´ energy/time behavior when the cluster nodes are connected by Ethernet or Infiniband network fabrics. For our representative iterative stencil loop algorithm, locality aware power optimizations show significant improvements in energy efficiency over native code running in default processor power modes.
Keywords :
multiprocessing systems; power aware computing; storage management; CPU-default approach; DVFS; Ethernet; Infiniband network fabrics; PGAS locality-awareness; PGAS power optimization; PGAS programming model; PGAS workloads; SMP clusters; code modification automation; code optimization; computer engineering; data locality-awareness; default processor power modes; dynamic voltage frequency scaling; energy consumption reduction; energy efficiency; intelligent energy management; interconnection network fabric; locality aware power optimizations; locality-aware power optimization; measurement methodology; native code; nonlocal access patterns; partitioned global address space programming model; power measurement framework; processor architecture level; profiler; program-execution; representative iterative stencil loop algorithm; symmetric multiprocessor cluster; thread-local data prefetch; time-based optimizations; workload sharing; Arrays; Electronics packaging; Instruction sets; Optimization; Programming; Runtime; DVFS; PGAS; SMP; UPC; data locality-awareness; energy management; energy measurement; parallel programming;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Green Computing Conference (IGCC), 2013 International
Conference_Location :
Arlington, VA
Type :
conf
DOI :
10.1109/IGCC.2013.6604508
Filename :
6604508
Link To Document :
بازگشت