• DocumentCode
    1366434
  • Title

    Hardware and software cache prefetching techniques for MPEG benchmarks

  • Author

    Zucker, Daniel F. ; Lee, Ruby B. ; Flynn, Michael J.

  • Author_Institution
    Dept. of Electr. Eng., Stanford Univ., CA, USA
  • Volume
    10
  • Issue
    5
  • fYear
    2000
  • fDate
    8/1/2000 12:00:00 AM
  • Firstpage
    782
  • Lastpage
    796
  • Abstract
    With the popularity of multimedia acceleration instructions such as MMX, MPEG decompression is increasingly executed on general purpose processors instead of dedicated MPEG hardware. The gap between processor speed and memory access means that a significant amount of time is spent in the memory system. As processors get faster-both in terms of higher clock speeds and increased instruction level parallelism-the time spent in the memory system becomes even more significant. Data prefetching is a well-known technique for improving cache performance. While several studies have examined prefetch strategies for scientific and commercial applications, this paper focuses on video applications. Data is presented for three types of hardware-prefetching schemes: the stream buffer, the stride prediction table (SPT), and the stream cache, as well as a new software-directed prefetching technique based on emulation of the hardware SPT. Up to 90% of the misses that would otherwise occur with no prefetching are eliminated. The stream cache can cut execution time by more than half with the addition of a relatively small amount of additional hardware. Software prefetching achieves nearly equal performance with minimal additional hardware. Techniques presented in this paper can be used to improve performance in a general-purpose CPU or an embedded MPEG processor. Performance gains achieved for MPEG benchmarks apply equally effectively to similar multimedia applications
  • Keywords
    cache storage; code standards; data compression; decoding; embedded systems; multimedia computing; parallel processing; telecommunication standards; video coding; MMX; MPEG benchmarks; MPEG decompression; cache performance; clock speeds; data prefetching; embedded MPEG processor; execution time; general purpose processors; general-purpose CPU; hardware SPT emulation; hardware cache prefetching; instruction level parallelism; memory access; memory system; multimedia acceleration instructions; multimedia applications; performance gains; processor speed; software cache prefetching; software-directed prefetching; stream buffer; stream cache; stride prediction table; video applications; Acceleration; Application software; Clocks; Emulation; Hardware; Laboratories; Performance gain; Prefetching; Software performance; Streaming media;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/76.856455
  • Filename
    856455