Row-buffer decoupling: A case for low-latency DRAM microarchitecture

Author

Seongil, O. ; Young Hoon Son ; Nam Sung Kim ; Jung Ho Ahn

Author_Institution

Seoul Nat. Univ., Seoul, South Korea

fYear

2014

fDate

14-18 June 2014

Firstpage

337

Lastpage

348

Abstract

Modern DRAM devices for the main memory are structured to have multiple banks to satisfy ever-increasing throughput, energy-efficiency, and capacity demands. Due to tight cost constraints, only one row can be buffered (opened) per bank and actively service requests at a time, while the row must be deactivated (closed) before a new row is stored into the row buffers. Hasty deactivation unnecessarily re-opens rows for otherwise row-buffer hits while hindsight accompanies the deactivation process on the critical path of accessing data for row-buffer misses. The time to (de)activate a row is comparable to the time to read an open row while applications are often sensitive to DRAM latency. Hence, it is critical to make the right decision on when to close a row. However, the increasing number of banks per DRAM device over generations reduces the number of requests per bank. This forces a memory controller to frequently predict when to close a row due to a lack of information on future requests, while the dynamic nature of memory access patterns limits the prediction accuracy. In this paper, we propose a novel DRAM microarchitecture that can eliminate the need for any prediction. First, we identify that precharging the bitlines dominates the deactivate time, while sense amplifiers that work as a row buffer are physically coupled with the bitlines such that a single command precharges both bitlines and sense amplifiers simultaneously. By decoupling the bitlines from the row buffers using isolation transistors, the bitlines can be precharged right after a row becomes activated. Therefore, only the sense amplifiers need to be precharged for a miss in most cases, taking an order of magnitude shorter time than the conventional deactivation process. Second, we show that this row-buffer decoupling enables internal DRAM μ-operations to be separated and recombined, which can be exploited by memory controllers to make the main memory system more energy efficient. Our experi- ents demonstrate that row-buffer decoupling improves the geometric mean of the instructions per cycle and MIPS²/W by 14% and 29%, respectively, for memory-intensive SPEC CPU2006 applications.

Keywords

DRAM chips; buffer storage; DRAM latency; deactivation process; energy efficiency; isolation transistors; low-latency DRAM microarchitecture; memory access patterns; memory controller; memory system; memory-intensive SPEC CPU2006 applications; modern DRAM devices; row-buffer decoupling; row-buffer misses; Buffer storage; Capacitance; Memory management; Microarchitecture; Random access memory; Transistors;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Architecture (ISCA), 2014 ACM/IEEE 41st International Symposium on

Conference_Location

Minneapolis, MN

Print_ISBN

978-1-4799-4396-8

Type

conf

DOI

10.1109/ISCA.2014.6853230

Filename

6853230

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=177348