DocumentCode :
1997010
Title :
Analyzing Optimization Techniques for Power Efficiency on Heterogeneous Platforms
Author :
Ukidave, Yash ; Kaeli, David R.
Author_Institution :
Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA, USA
fYear :
2013
fDate :
20-24 May 2013
Firstpage :
1040
Lastpage :
1049
Abstract :
Graphics processing units (GPUs) have become widely accepted as the computing platform of choice in many high performance computing domains. The availability of programming standards such as OpenCL are used to leverage the inherent parallelism offered by GPUs. Source code optimizations such as loop unrolling and tiling when targeted to heterogeneous applications have reported large gains in performance. However, given the power consumption of GPUs, platforms can exhaust their power budgets quickly. Better solutions are needed to effectively exploit the power-efficiency available on heterogeneous systems. In this work, we evaluate the power/performance efficiency of different optimizations used on heterogeneous applications. We analyze the power/performance trade-off by evaluating energy consumption of the optimizations. We compare the performance of different optimization techniques on 4 different Fast Fourier Transform implementations. Our study covers discrete GPUs and shared memory GPUs (APUs), and includes hardware from AMD (Llano APUs and the Southern Islands GPU), Nvidia (Kepler) and Intel (Ivy Bridge) as test platforms. The study identifies the architectural and algorithmic factors which can most impact power consumption. We explore arange of application optimizations which show an increase in power consumption by 27%, but result in more than a 1.8Xspeedup in performance. We observe a 11% variation in energy consumption among different optimizations. We highlight how different optimizations can improve the execution performance of a heterogeneous application, but also impact power efficiency of the application.
Keywords :
fast Fourier transforms; graphics processing units; power aware computing; power consumption; AMD; Intel; Nvidia; OpenCL programming standards; algorithmic factors; architectural factors; discrete GPUs; energy consumption; fast Fourier transform; graphics processing units; heterogeneous platforms; heterogeneous systems; high performance computing domains; power budgets; power consumption; power efficiency; shared memory GPUs; source code optimization techniques; test platforms; Discrete Fourier transforms; Graphics processing units; Kernel; Optimization; Performance evaluation; Power demand; Power measurement; FFT; OpenCL; Optimizations; Power;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location :
Cambridge, MA
Print_ISBN :
978-0-7695-4979-8
Type :
conf
DOI :
10.1109/IPDPSW.2013.220
Filename :
6650989
Link To Document :
بازگشت