Title :
Energy-efficient GPGPU architectures via collaborative compilation and memristive memory-based computing
Author :
Rahimi, Azar ; Ghofrani, Amirali ; Lastras-Montano, Miguel Angel ; Kwang-Ting Cheng ; Benini, Luca ; Gupta, R.K.
Author_Institution :
Dept. of CSE, UC San Diego, La Jolla, CA, USA
Abstract :
Thousands of deep and wide pipelines working concurrently make GPGPU high power consuming parts. Energy-efficiency techniques employ voltage overscaling that increases timing sensitivity to variations and hence aggravating the energy use issues. This paper proposes a method to increase spatiotemporal reuse of computational effort by a combination of compilation and micro-architectural design. An associative memristive memory (AMM) module is integrated with the floating point units (FPUs). Together, we enable fine-grained partitioning of values and find high-frequency sets of values for the FPUs by searching the space of possible inputs, with the help of application-specific profile feedback. For every kernel execution, the compiler pre-stores these high-frequent sets of values in AMM modules - representing partial functionality of the associated FPU- that are concurrently evaluated over two clock cycles. Our simulation results show high hit rates with 32-entry AMM modules that enable 36% reduction in average energy use by the kernel codes. Compared to voltage overscaling, this technique enhances robustness against timing errors with 39% average energy saving.
Keywords :
energy conservation; graphics processing units; power aware computing; program compilers; AMM module; FPU; application-specific profile feedback; associative memristive memory module; collaborative compilation; energy saving; energy use; energy-efficiency techniques; energy-efficient GPGPU architecture; floating point units; general purpose graphics processing unit; kernel execution; memristive memory-based computing; microarchitectural design; spatiotemporal reuse; timing sensitivity; voltage overscaling; Clocks; Collaboration; Computer architecture; Kernel; Memristors; Pipelines; Timing; Energy efficiency; GPGPUs; compiler; memory-based computing; memristor; timing errors; variations;
Conference_Titel :
Design Automation Conference (DAC), 2014 51st ACM/EDAC/IEEE
Conference_Location :
San Francisco, CA