DocumentCode :
704074
Title :
Approximate associative memristive memory for energy-efficient GPUs
Author :
Rahimi, Abbas ; Ghofrani, Amirali ; Kwang-Ting Cheng ; Benini, Luca ; Gupta, Rajesh K.
Author_Institution :
CSE, UC San Diego, La Jolla, CA, USA
fYear :
2015
fDate :
9-13 March 2015
Firstpage :
1497
Lastpage :
1502
Abstract :
Multimedia applications running on thousands of deep and wide pipelines working concurrently in GPUs have been an important target for power minimization both at the architectural and algorithmic levels. At the hardware level, energy-efficiency techniques that employ voltage overscaling face a barrier so-called “path walls”: reducing operating voltage beyond a certain point generates massive number of timing errors that are impractical to tolerate. We propose an architectural innovation, called A2M2 module (approximate associative memristive memory) that exhibits few tolerable timing errors suitable for GPU applications under voltage overscaling. A2M2 is integrated with every floating point unit (FPU), and performs partial functionality of the associated FPU by pre-storing high frequency patterns for computational reuse that avoids overhead due to re-execution. Voltage overscaled A2M2 is designed to match an input search pattern with any of the stored patterns within a Hamming distance range of 0-2. This matching behavior under voltage overscaling leads to a controllable approximate computing for multimedia applications. Our experimental results for the AMD Southern Islands GPU show that four image processing kernels tolerate the mismatches during pattern matching resulting in a PSNR ≥ 30dB. The A2M2 module with 8-row enables 28% voltage overscaling in 45nm technology resulting in 32% average energy saving for the kernels, while delivering an acceptable quality of service.
Keywords :
content-addressable storage; energy conservation; graphics processing units; image matching; multimedia systems; power aware computing; quality of service; A2M2 module; AMD Southern Islands GPU; FPU; Hamming distance; algorithmic level; approximate associative memristive memory; architectural innovation; architectural level; computational reuse; energy-efficient GPU; floating point unit; high frequency patterns; image processing kernels; multimedia applications; path walls; pattern matching; power minimization; quality of service; timing errors; voltage overscaling; Approximation methods; Graphics processing units; Image processing; Kernel; Memristors; PSNR; Pattern matching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015
Conference_Location :
Grenoble
Print_ISBN :
978-3-9815-3704-8
Type :
conf
Filename :
7092626
Link To Document :
بازگشت