DocumentCode
3602171
Title
Toggle-Aware Compression for GPUs
Author
Pekhimenko, Gennady ; Bolotin, Evgeny ; O´Connor, Mike ; Mutlu, Onur ; Mowry, Todd C. ; Keckler, Stephen W.
Author_Institution
Dept. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
14
Issue
2
fYear
2015
Firstpage
164
Lastpage
168
Abstract
Memory bandwidth compression can be an effective way to achieve higher system performance and energy efficiency in modern data-intensive applications by exploiting redundancy in data. Prior works studied various data compression techniques to improve both capacity (e.g., of caches and main memory) and bandwidth utilization (e.g., of the on-chip and off-chip interconnects). These works addressed two common shortcomings of compression: (i) compression/decompression overhead in terms of latency, energy, and area, and (ii) hardware complexity to support variable data size. In this paper, we make the new observation that there is another important problem related to data compression in the context of the communication energy efficiency: transferring compressed data leads to a substantial increase in the number of bit toggles (communication channel switchings from 0 to 1 or from 1 to 0). This, in turn, increases the dynamic energy consumed by on-chip and off-chip buses due to more frequent charging and discharging of the wires. Our results, for example, show that the bit toggle count increases by an average of 2.2× with some compression algorithms across 54 mobile GPU applications. We characterize and demonstrate this new problem across a wide variety of 221 GPU applications and six different compression algorithms. To mitigate the problem, we propose two new toggle-aware compression techniques: energy control and Metadata Consolidation. These techniques greatly reduce the bit toggle count impact of the six data compression algorithms we examine, while keeping most of their bandwidth reduction benefits.
Keywords
data compression; graphics processing units; power aware computing; bandwidth utilization; bit toggle count impact; bit toggles; communication energy efficiency; compression/decompression overhead; data compression algorithms; data compression techniques; data redundancy; dynamic energy; energy control; hardware complexity; memory bandwidth compression; metadata consolidation; mobile GPU applications; modern data-intensive applications; off-chip buses; on-chip buses; toggle-aware compression; variable data size; Communication channels; Compression algorithms; Data compression; Graphics processing units; Mobile communication; System-on-chip; Data compression, interconnected systems, memory;
fLanguage
English
Journal_Title
Computer Architecture Letters
Publisher
ieee
ISSN
1556-6056
Type
jour
DOI
10.1109/LCA.2015.2430853
Filename
7103282
Link To Document