• DocumentCode
    1282129
  • Title

    Branch prediction, instruction-window size, and cache size: performance trade-offs and simulation techniques

  • Author

    Skadron, Kevin ; Ahuja, Pritpal S. ; Martonosi, Margaret ; Clark, Douglas W.

  • Author_Institution
    Dept. of Comput. Sci., Virginia Univ., Charlottesville, VA, USA
  • Volume
    48
  • Issue
    11
  • fYear
    1999
  • fDate
    11/1/1999 12:00:00 AM
  • Firstpage
    1260
  • Lastpage
    1281
  • Abstract
    Design parameters interact in complex ways in modern processors, especially because out-of-order issue and decoupling buffers allow latencies to be overlapped. Trade-offs among instruction-window size, branch-prediction accuracy, and instruction- and data-cache size can change as these parameters move through different domains. For example, modeling unrealistic caches can under- or overstate the benefits of better prediction or a larger instruction window. Avoiding such pitfalls requires understanding how all these parameters interact. Because such methodological mistakes are common, this paper provides a comprehensive set of SimpleScalar simulation results from SPECint95 programs, showing the interactions among these major structures. In addition to presenting this database of simulation results, major mechanisms driving the observed trade-offs are described. The paper also considers appropriate simulation techniques when sampling full-length runs with the SPEC reference inputs. In particular, the results show that branch mispredictions limit the benefits of larger instruction windows, that better branch prediction and better instruction cache behavior have synergistic effects, and that the benefits of larger instruction windows and larger data caches trade off and have overlapping effects. In addition, simulations of only 50 million instructions can yield representative results if these short windows are carefully selected
  • Keywords
    cache storage; computer architecture; digital simulation; performance evaluation; SPECint95 programs; SimpleScalar simulation; branch prediction; branch-prediction accuracy; cache size; decoupling buffers; instruction-window size; performance tradeoffs; simulation techniques; Computer science; Databases; Delay; Marine vehicles; Microarchitecture; Out of order; Predictive models; Sampling methods; Senior members; Tellurium;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/12.811115
  • Filename
    811115