Title :
Exploring branch target buffer access filtering for low-energy and high-performance microarchitectures
Author :
Wang, Shuhui ; Hu, Jiankun ; Ziavras, Sotirios G.
Author_Institution :
Nat. Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China
fDate :
1/1/2012 12:00:00 AM
Abstract :
Powerful branch predictors along with a large branch target buffer (BTB) are employed in superscalar and simultaneous multi-threading (SMT) processors for instruction-level parallelism and thread-level parallelism exploitation. However, the large BTB not only dominates the predictor energy consumption, but also becomes a major roadblock in achieving faster clock frequencies at deep sub-micron technologies. The authors propose here a filtering scheme to dramatically reduce the accesses to the BTB to achieve significantly reduced energy consumption in the BTB while maintaining the performance. For a simulated superscalar microprocessor, the experimental evaluation shows that the BTB access filtering (BAF) design achieves an 88.5% dynamic energy reduction with negligible performance loss. The authors also study the leakage behaviour and its control in the BAF design. The results show that by applying a drowsy strategy, very effective leakage control can be achieved. For the high-performance design, the BAF can also improve BTB%s performance scalability at new technologies. For the simultaneous multi-threading environment, the authors evaluate the effectiveness of the BAF design and propose a banked BAF (BK-BAF) scheme to further reduce the energy consumption and performance overhead. The experimental results confirm that the BK-BAF scheme can be an energy/performance-effective design for next generation SMT processors.
Keywords :
channel bank filters; clocks; energy consumption; multi-threading; multiprocessing systems; parallel architectures; BTB access filtering design; banked BAF scheme; branch target buffer access filtering; deep sub-micron technologies; dynamic energy reduction; energy consumption; filtering scheme; high-performance microarchitectures; instruction-level parallelism; low-energy microarchitectures; simulated superscalar microprocessor; simultaneous multi-threading processors; thread-level parallelism exploitation;
Journal_Title :
Computers & Digital Techniques, IET
DOI :
10.1049/iet-cdt.2010.0102