DocumentCode
1474391
Title
High bandwidth on-chip cache design
Author
Wilson, Kenneth M. ; Olukotun, Kunle
Author_Institution
Hewlett-Packard Co., Palo Alto, CA, USA
Volume
50
Issue
4
fYear
2001
fDate
4/1/2001 12:00:00 AM
Firstpage
292
Lastpage
307
Abstract
In this paper, we evaluate the performance of high bandwidth cache organizations employing multiple cache ports, multiple cycle hit times, and cache port efficiency enhancements, such as load all and line buffer, to find the organization that provides the best processor performance. Using a dynamic superscalar processor running realistic benchmarks that include operating system references, we use execution time to measure processor performance. When the cache is limited to a single cache port without enhancements, we find that two cache ports increase processor performance by 25 percent. With the addition of line buffer and load all to a single pelted cache, the processor achieves 91 percent of the performance of the same processor containing a cache with two ports. When the processor is not limited to a single cache port, the results show that a large dual-ported multicycle pipelined SRAM cache with a line buffer maximizes processor performance. A large pipelined cache provides both a low miss rate and a high CPU clock frequency. Dual-porting the cache and using a line buffer provide the bandwidth needed by a dynamic superscalar processor. The line buffer makes the pipelined dual-ported cache the best option by increasing cache port bandwidth and hiding cache latency
Keywords
cache storage; memory architecture; cache design; dual-ported multicycle pipelined SRAM cache; high bandwidth cache; line buffer; performance; pipelined cache; Bandwidth; Buffer storage; Clocks; Delay; Frequency; Microprocessors; Operating systems; Random access memory; System-on-a-chip; Time measurement;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/12.919276
Filename
919276
Link To Document