DocumentCode :
2065910
Title :
Fast, Performance-Optimized Partial Match Address Compression for Low-Latency On-Chip Address Buses
Author :
Liu, Jiangjiang ; Sundaresan, Krishnan ; Mahapatra, Nihar R.
Author_Institution :
Lamar Univ., Beaumont
fYear :
2007
fDate :
1-4 Oct. 2007
Firstpage :
17
Lastpage :
24
Abstract :
The influence of interconnects on processor performance and cost is becoming increasingly pronounced with technology scaling. In this paper, we present a fast compression scheme that exploits the spatial and temporal locality of addresses to dynamically compress them to different extents depending upon the extent to which they match the higher-order portions of recently-occurring addresses saved in a very small "compression cache" of capacity less than 500 bits. When a maximal match occurs, the address is compressed to the maximum extent and is transmitted on a narrow bus in one cycle. When a partial match occurs, one or more extra cycles are required for address transmission depending upon the extent of the partial match. To minimize this transmission cycle penalty (TCP), we use an efficient algorithm to determine the optimal set of partial matches to be supported in our partial match compression (PMC) scheme - we refer to this scheme as performance-optimized PMC (PO-PMC). A previously-proposed scheme called bus expander (BE) supports only a single, fixed-size match for compression. We show that all addresses that result in (maximal) matches in BE also result in the same in PMC, but the remaining addresses that are considered "no matches" in BE frequently result in partial matches in PMC, thus helping curtail the latter\´s TCP significantly. Across many SPEC CPU2000 integer and floating-point benchmarks, we find that average program performance improves by 3% when using PO-PMC compared to that when using BE. Further, we investigate how area slack arising from compression can be exploited for bus latency improvement by increasing inter-wire spacing. We find that, on the average, it can reduce bus latency by up to 84.63% and thereby improve program performance by about 16%.
Keywords :
cache storage; field buses; microprocessor chips; SPEC CPU2000; bus expander; compression cache; floating-point benchmarks; integer benchmarks; inter-wire spacing; low-latency on-chip address buses; partial match compression; performance-optimized PMC; performance-optimized partial match address compression; transmission cycle penalty; Clocks; Costs; Delay; Frequency; Integrated circuit interconnections; LAN interconnection; Microarchitecture; Registers; Wires; Wiring;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Design, 2006. ICCD 2006. International Conference on
Conference_Location :
San Jose, CA
ISSN :
1063-6404
Print_ISBN :
978-0-7803-9707-1
Electronic_ISBN :
1063-6404
Type :
conf
DOI :
10.1109/ICCD.2006.4380788
Filename :
4380788
Link To Document :
بازگشت