DocumentCode
3663924
Title
Data reorganization in memory using 3D-stacked DRAM
Author
Berkin Akin;Franz Franchetti;James C. Hoe
Author_Institution
Carnegie Mellon University, USA
fYear
2015
fDate
6/1/2015 12:00:00 AM
Firstpage
131
Lastpage
143
Abstract
In this paper we focus on common data reorganization operations such as shuffle, pack/unpack, swap, transpose, and layout transformations. Although these operations simply relocate the data in the memory, they are costly on conventional systems mainly due to inefficient access patterns, limited data reuse and roundtrip data traversal throughout the memory hierarchy. This paper presents a two pronged approach for efficient data reorganization, which combines (i) a proposed DRAM-aware reshape accelerator integrated within 3D-stacked DRAM, and (ii) a mathematical framework that is used to represent and optimize the reorganization operations. We evaluate our proposed system through two major use cases. First, we demonstrate the reshape accelerator in performing a physical address remapping via data layout transform to utilize the internal parallelism/locality of the 3D-stacked DRAM structure more efficiently for general purpose workloads. Then, we focus on offloading and accelerating commonly used data reorganization routines selected from the Intel Math Kernel Library package. We evaluate the energy and performance benefits of our approach by comparing it against existing optimized implementations on state-of-the-art GPUs and CPUs. For the various test cases, in-memory data reorganization provides orders of magnitude performance and energy efficiency improvements via low overhead hardware.
Keywords
"Random access memory","Silicon","Hardware","Legged locomotion","Semantics","Filling"
Publisher
ieee
Conference_Titel
Computer Architecture (ISCA), 2015 ACM/IEEE 42nd Annual International Symposium on
Type
conf
DOI
10.1145/2749469.2750397
Filename
7284061
Link To Document