DocumentCode
1697571
Title
Can High Bandwidth and Latency Justify Large Cache Blocks in Scalable Multiprocessors?
Author
Bianchini, R. ; LeBlanc, T.J.
Author_Institution
University of Rochester, USA
Volume
1
fYear
1994
Firstpage
258
Lastpage
262
Abstract
An important architectural design decision affecting the performance of coherent caches is the choice of block size. There are two primary factors that influence this choice: the reference behavior of applications and the remote access bandwidth and latency of the machine. Given that we anticipate increases in both network bandwidth and latency (in processor cycles) in scalable shared-memory multiprocessors, the question arises as to what effect these increases will have on the choice of block size. We use analytical modeling and execution-driven simulation of parallel programs on a large-scale shared-memory machine to examine the relationship between cache block size and application performance as a function of remote access bandwidth and latency. We show that even under assumptions of high remote access bandwidth and latency, the best application performance usually results from using cache blocks between S2 and 128 bytes in size. We also show that modifying the program to remove the dominant source of misses may not increase the best performing block size. We conclude that large cache blocks cannot be justified in most realistic scenarios.
Keywords
Bandwidth; Coherence; Computer architecture; Costs; Delay; Impedance; Memory architecture; Parallel processing; Read-write memory; System performance;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing, 1994. Vol. 1. ICPP 1994. International Conference on
Conference_Location
North Carolina State University, NC, USA
ISSN
0190-3918
Print_ISBN
0-8493-2493-9
Type
conf
DOI
10.1109/ICPP.1994.66
Filename
4115727
Link To Document