DocumentCode :
1366434
Title :
Hardware and software cache prefetching techniques for MPEG benchmarks
Author :
Zucker, Daniel F. ; Lee, Ruby B. ; Flynn, Michael J.
Author_Institution :
Dept. of Electr. Eng., Stanford Univ., CA, USA
Volume :
10
Issue :
5
fYear :
2000
fDate :
8/1/2000 12:00:00 AM
Firstpage :
782
Lastpage :
796
Abstract :
With the popularity of multimedia acceleration instructions such as MMX, MPEG decompression is increasingly executed on general purpose processors instead of dedicated MPEG hardware. The gap between processor speed and memory access means that a significant amount of time is spent in the memory system. As processors get faster-both in terms of higher clock speeds and increased instruction level parallelism-the time spent in the memory system becomes even more significant. Data prefetching is a well-known technique for improving cache performance. While several studies have examined prefetch strategies for scientific and commercial applications, this paper focuses on video applications. Data is presented for three types of hardware-prefetching schemes: the stream buffer, the stride prediction table (SPT), and the stream cache, as well as a new software-directed prefetching technique based on emulation of the hardware SPT. Up to 90% of the misses that would otherwise occur with no prefetching are eliminated. The stream cache can cut execution time by more than half with the addition of a relatively small amount of additional hardware. Software prefetching achieves nearly equal performance with minimal additional hardware. Techniques presented in this paper can be used to improve performance in a general-purpose CPU or an embedded MPEG processor. Performance gains achieved for MPEG benchmarks apply equally effectively to similar multimedia applications
Keywords :
cache storage; code standards; data compression; decoding; embedded systems; multimedia computing; parallel processing; telecommunication standards; video coding; MMX; MPEG benchmarks; MPEG decompression; cache performance; clock speeds; data prefetching; embedded MPEG processor; execution time; general purpose processors; general-purpose CPU; hardware SPT emulation; hardware cache prefetching; instruction level parallelism; memory access; memory system; multimedia acceleration instructions; multimedia applications; performance gains; processor speed; software cache prefetching; software-directed prefetching; stream buffer; stream cache; stride prediction table; video applications; Acceleration; Application software; Clocks; Emulation; Hardware; Laboratories; Performance gain; Prefetching; Software performance; Streaming media;
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/76.856455
Filename :
856455
Link To Document :
بازگشت