DocumentCode
2193591
Title
Improving trace cache effectiveness with branch promotion and trace packing
Author
Patel, Sanjay Jeram ; Evers, Marius ; Patt, Yale N.
Author_Institution
Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
fYear
1998
fDate
27 Jun-1 Jul 1998
Firstpage
262
Lastpage
271
Abstract
The increasing widths of superscalar processors are placing greater demands upon the fetch mechanism. The trace cache meets these demands by placing logically contiguous instructions in physically contiguous storage. As a result, the trace cache delivers instructions at a high rate by supplying multiple fetch blocks each cycle. In this paper we examine two techniques to improve the number of instructions delivered each cycle by the trace cache. The first technique, branch promotion, dynamically converts strongly biased branches into branches with static predictions. Because these promoted branches require no dynamic prediction, the branch predictor suffers less from the negative effects of interference. Branch promotion unlocks the potential of the second technique: trace packing. With trace packing, trace segments are packed with as many instructions as will fit, without regard to naturally occurring fetch block boundaries. With both techniques, the effective fetch rate of the trace cache jumps up 17% over a trace cache which implements neither on a machine where the execution engine has a very aggressive memory disambiguator; the performance of a machine using branch promotion and trace packing is on average 11% higher than a machine using neither technique
Keywords
parallel architectures; performance evaluation; branch promotion; fetch mechanism; logically contiguous instructions; multiple fetch blocks; physically contiguous storage; superscalar processors; trace cache effectiveness; trace packing; trace segments; Bandwidth; Cache storage; Computer architecture; Decoding; Electrical capacitance tomography; Electronic switching systems; Engines; Etching; Laboratories; Read only memory;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Architecture, 1998. Proceedings. The 25th Annual International Symposium on
Conference_Location
Barcelona
ISSN
1063-6897
Print_ISBN
0-8186-8491-7
Type
conf
DOI
10.1109/ISCA.1998.694786
Filename
694786
Link To Document