Title :
GPGPU workload characteristics and performance analysis
Author :
Lal, Sunil ; Lucas, Jerome ; Andersch, Michael ; Alvarez-Mesa, Mauricio ; Elhossini, Ahmed ; Juurlink, Ben
Author_Institution :
Embedded Syst. Archit., Tech. Univ. Berlin, Berlin, Germany
Abstract :
GPUs are much more power-efficient devices compared to CPUs, but due to several performance bottlenecks, the performance per watt of GPUs is often much lower than what could be achieved theoretically. To sustain and continue high performance computing growth, new architectural and application techniques are required to create power-efficient computing systems. To find such techniques, however, it is necessary to study the power consumption at a detailed level and understand the bottlenecks which cause low performance. Therefore, in this paper, we study GPU power consumption at component level and investigate the bottlenecks that cause low performance and low energy efficiency. We divide the low performance kernels into low occupancy and full occupancy categories. For the low occupancy category, we study if increasing the occupancy helps in increasing performance and energy efficiency. For the full occupancy category, we investigate if these kernels are limited by memory bandwidth, coalescing efficiency, or SIMD utilization.
Keywords :
graphics processing units; parallel processing; power aware computing; CPU; GPGPU workload characteristics; GPU power consumption; SIMD utilization; coalescing efficiency; component level; full occupancy category; high performance computing growth; low energy efficiency; low occupancy; low performance kernels; memory bandwidth; performance analysis; power-efficient computing system; power-efficient devices; Benchmark testing; Correlation; Graphics processing units; Instruction sets; Kernel; Measurement; Power demand;
Conference_Titel :
Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XIV), 2014 International Conference on
Conference_Location :
Agios Konstantinos
DOI :
10.1109/SAMOS.2014.6893202