• DocumentCode
    2958770
  • Title

    Miss-Correlation Folding: Encoding Per-Block Miss Correlations in Compressed DRAM for Data Prefetching

  • Author

    Gang Liu ; Jih-Kwon Peir ; Lee, Victor

  • Author_Institution
    Dept. of Comput. & Inf. Sci. & Eng, Univ. of Florida, Gainesville, FL, USA
  • fYear
    2012
  • fDate
    21-25 May 2012
  • Firstpage
    691
  • Lastpage
    702
  • Abstract
    Cache misses frequently exhibit repeated streaming behavior, i.e. a sequence of cache misses has a high tendency of being repeated. Correlation-based prefetchers record the missing streams in a history table for accurate prefetching. Saving a large miss history in off-chip DRAM is a practical implementation, but incurs access latency and consumes memory bandwidth which leads to performance degradation. In this paper, we investigate a new data prefetching mechanism based on per-block miss correlation where a miss is correlated with an earlier miss when the two misses are closely encountered both in time and space. The miss correlations are captured dynamically and saved along with the content of the data block using a simple data compression technique. As a result of this novel combination, our scheme provides unbounded correlation history and its prefetch metadata can be fetched together with demand data without incurring additional latency nor consuming any memory bandwidth. Performance evaluations using data-parallel applications demonstrate that prefetchers based on per-block miss correlations can improve IPC by 42-139% with an average of 88% compared to the IPC without prefetching. In comparison with regular stream prefetcher, sampled temporal streaming prefetcher and spatial-temporal memory streaming prefetcher, up to 115%, 99% and 98% IPC improvement can be obtained with an average about 36%, 26% and 27% respectively.
  • Keywords
    DRAM chips; cache storage; data compression; encoding; meta data; parallel processing; IPC improvement; access latency; compressed DRAM; correlation-based prefetchers; data compression technique; data prefetching; data-parallel applications; demand data; memory bandwidth; miss-correlation folding; missing streams; off-chip DRAM; per-block miss correlations encoding; performance degradation; prefetch metadata; regular stream prefetcher; repeated streaming behavior; sampled temporal streaming prefetcher; spatial-temporal memory streaming prefetcher; unbounded correlation history; Bandwidth; Correlation; Encoding; History; Memory management; Prefetching; Random access memory; cache; compress; data parallel; miss correlation; prefetch; spatial; temporal;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International
  • Conference_Location
    Shanghai
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4673-0975-2
  • Type

    conf

  • DOI
    10.1109/IPDPS.2012.68
  • Filename
    6267870