DocumentCode
2958770
Title
Miss-Correlation Folding: Encoding Per-Block Miss Correlations in Compressed DRAM for Data Prefetching
Author
Gang Liu ; Jih-Kwon Peir ; Lee, Victor
Author_Institution
Dept. of Comput. & Inf. Sci. & Eng, Univ. of Florida, Gainesville, FL, USA
fYear
2012
fDate
21-25 May 2012
Firstpage
691
Lastpage
702
Abstract
Cache misses frequently exhibit repeated streaming behavior, i.e. a sequence of cache misses has a high tendency of being repeated. Correlation-based prefetchers record the missing streams in a history table for accurate prefetching. Saving a large miss history in off-chip DRAM is a practical implementation, but incurs access latency and consumes memory bandwidth which leads to performance degradation. In this paper, we investigate a new data prefetching mechanism based on per-block miss correlation where a miss is correlated with an earlier miss when the two misses are closely encountered both in time and space. The miss correlations are captured dynamically and saved along with the content of the data block using a simple data compression technique. As a result of this novel combination, our scheme provides unbounded correlation history and its prefetch metadata can be fetched together with demand data without incurring additional latency nor consuming any memory bandwidth. Performance evaluations using data-parallel applications demonstrate that prefetchers based on per-block miss correlations can improve IPC by 42-139% with an average of 88% compared to the IPC without prefetching. In comparison with regular stream prefetcher, sampled temporal streaming prefetcher and spatial-temporal memory streaming prefetcher, up to 115%, 99% and 98% IPC improvement can be obtained with an average about 36%, 26% and 27% respectively.
Keywords
DRAM chips; cache storage; data compression; encoding; meta data; parallel processing; IPC improvement; access latency; compressed DRAM; correlation-based prefetchers; data compression technique; data prefetching; data-parallel applications; demand data; memory bandwidth; miss-correlation folding; missing streams; off-chip DRAM; per-block miss correlations encoding; performance degradation; prefetch metadata; regular stream prefetcher; repeated streaming behavior; sampled temporal streaming prefetcher; spatial-temporal memory streaming prefetcher; unbounded correlation history; Bandwidth; Correlation; Encoding; History; Memory management; Prefetching; Random access memory; cache; compress; data parallel; miss correlation; prefetch; spatial; temporal;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International
Conference_Location
Shanghai
ISSN
1530-2075
Print_ISBN
978-1-4673-0975-2
Type
conf
DOI
10.1109/IPDPS.2012.68
Filename
6267870
Link To Document