• DocumentCode
    602637
  • Title

    Tiered-latency DRAM: A low latency and low cost DRAM architecture

  • Author

    Donghyuk Lee ; Yoongu Kim ; Seshadri, Vivek ; Liu, Jiangchuan ; Subramanian, Lavanya ; Mutlu, Onur

  • Author_Institution
    Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2013
  • fDate
    23-27 Feb. 2013
  • Firstpage
    615
  • Lastpage
    626
  • Abstract
    The capacity and cost-per-bit of DRAM have historically scaled to satisfy the needs of increasingly large and complex computer systems. However, DRAM latency has remained almost constant, making memory latency the performance bottleneck in today´s systems. We observe that the high access latency is not intrinsic to DRAM, but a trade-off made to decrease cost-per-bit. To mitigate the high area overhead of DRAM sensing structures, commodity DRAMs connect many DRAM cells to each sense-amplifier through a wire called a bitline. These bitlines have a high parasitic capacitance due to their long length, and this bitline capacitance is the dominant source of DRAM latency. Specialized low-latency DRAMs use shorter bitlines with fewer cells, but have a higher cost-per-bit due to greater sense-amplifier area overhead. In this work, we introduce Tiered-Latency DRAM (TL-DRAM), which achieves both low latency and low cost-per-bit. In TL-DRAM, each long bitline is split into two shorter segments by an isolation transistor, allowing one segment to be accessed with the latency of a short-bitline DRAM without incurring high cost-per-bit. We propose mechanisms that use the low-latency segment as a hardware-managed or software-managed cache. Evaluations show that our proposed mechanisms improve both performance and energy-efficiency for both single-core and multi-programmed workloads.
  • Keywords
    DRAM chips; cache storage; energy conservation; memory architecture; multiprogramming; performance evaluation; power aware computing; transistors; DRAM capacity; DRAM cost-per-bit; DRAM sensing structures; TL-DRAM; bitline; complex computer systems; energy-efficiency; hardware-managed cache; high parasitic capacitance; isolation transistor; low-latency low-cost DRAM architecture; memory latency; multiprogrammed workloads; single-core workloads; software-managed cache; tiered-latency DRAM; Capacitance; Capacitors; Computer architecture; DRAM chips; Timing; Transistors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computer Architecture (HPCA2013), 2013 IEEE 19th International Symposium on
  • Conference_Location
    Shenzhen
  • ISSN
    1530-0897
  • Print_ISBN
    978-1-4673-5585-8
  • Type

    conf

  • DOI
    10.1109/HPCA.2013.6522354
  • Filename
    6522354