DocumentCode :
2267953
Title :
Evaluating the Memory System Performance of Software-Initiated Inter-core LLC Prepushing
Author :
Cai, Min ; Gu, Zhimin
Author_Institution :
Sch. of Comput. Sci. & Technol., Beijing Inst. of Technol., Beijing, China
fYear :
2011
fDate :
26-28 May 2011
Firstpage :
216
Lastpage :
221
Abstract :
Data prefetching speculatively issue memory requests for data needed later by the main computation, and therefore can lead to increased stress on limited resources on chip multiprocessors. If not properly used, it can cause harmful effects such as cache pollution and waste of bandwidth. Therefore, accurate and fine grain measurement of the related runtime metrics is important as the first step in reducing harmful prefetches and increasing memory level parallelism on chip multiprocessors. However, the required measurement is prohibitively impossible on real machines without bringing nontrivial performance overhead and thus leading to inaccurate results. In this paper, we use cycle accurate full-system simulation to study the memory system performance of our previous proposed data prefetching technique with control of harmful prefetches on chip multiprocessors - software-initiated inter-core LLC prepushing. We modified the GEMS multiprocessor simulator to support trace-based measurement and offline analysis of MLP, DRAM BLP and their relationship with software-initiated intercore LLC prepushing. Results show that, prepushing can achieve speedups of 1.628, 1.019 and 1.032 in mst, em3d and 429.mcf, respectively. Average L2 MLP is increased by 26%, 0.3% and-1%, in mst, em3d and 429.mcf, respectively.
Keywords :
DRAM chips; microprocessor chips; storage management; DRAM BLP; GEMS multiprocessor simulator; MLP; cycle accurate full-system simulation; data prefetching; memory level parallelism; memory system performance evaluation; on chip multiprocessors; software-initiated intercore LLC Prepushing; Measurement; Multicore processing; Object oriented modeling; Prefetching; Random access memory; System performance; architectural simulation; chip multiprocessors; data prefetching; memory system performance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing with Applications Workshops (ISPAW), 2011 Ninth IEEE International Symposium on
Conference_Location :
Busan
Print_ISBN :
978-1-4577-0524-3
Electronic_ISBN :
978-0-7695-4429-8
Type :
conf
DOI :
10.1109/ISPAW.2011.56
Filename :
5951977
Link To Document :
بازگشت