DocumentCode :
2958770
Title :
Miss-Correlation Folding: Encoding Per-Block Miss Correlations in Compressed DRAM for Data Prefetching
Author :
Gang Liu ; Jih-Kwon Peir ; Lee, Victor
Author_Institution :
Dept. of Comput. & Inf. Sci. & Eng, Univ. of Florida, Gainesville, FL, USA
fYear :
2012
fDate :
21-25 May 2012
Firstpage :
691
Lastpage :
702
Abstract :
Cache misses frequently exhibit repeated streaming behavior, i.e. a sequence of cache misses has a high tendency of being repeated. Correlation-based prefetchers record the missing streams in a history table for accurate prefetching. Saving a large miss history in off-chip DRAM is a practical implementation, but incurs access latency and consumes memory bandwidth which leads to performance degradation. In this paper, we investigate a new data prefetching mechanism based on per-block miss correlation where a miss is correlated with an earlier miss when the two misses are closely encountered both in time and space. The miss correlations are captured dynamically and saved along with the content of the data block using a simple data compression technique. As a result of this novel combination, our scheme provides unbounded correlation history and its prefetch metadata can be fetched together with demand data without incurring additional latency nor consuming any memory bandwidth. Performance evaluations using data-parallel applications demonstrate that prefetchers based on per-block miss correlations can improve IPC by 42-139% with an average of 88% compared to the IPC without prefetching. In comparison with regular stream prefetcher, sampled temporal streaming prefetcher and spatial-temporal memory streaming prefetcher, up to 115%, 99% and 98% IPC improvement can be obtained with an average about 36%, 26% and 27% respectively.
Keywords :
DRAM chips; cache storage; data compression; encoding; meta data; parallel processing; IPC improvement; access latency; compressed DRAM; correlation-based prefetchers; data compression technique; data prefetching; data-parallel applications; demand data; memory bandwidth; miss-correlation folding; missing streams; off-chip DRAM; per-block miss correlations encoding; performance degradation; prefetch metadata; regular stream prefetcher; repeated streaming behavior; sampled temporal streaming prefetcher; spatial-temporal memory streaming prefetcher; unbounded correlation history; Bandwidth; Correlation; Encoding; History; Memory management; Prefetching; Random access memory; cache; compress; data parallel; miss correlation; prefetch; spatial; temporal;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International
Conference_Location :
Shanghai
ISSN :
1530-2075
Print_ISBN :
978-1-4673-0975-2
Type :
conf
DOI :
10.1109/IPDPS.2012.68
Filename :
6267870
Link To Document :
بازگشت