• DocumentCode
    2535267
  • Title

    "Flea-flicker" multipass pipelining: an alternative to the high-power out-of-order offense

  • Author

    Barnes, Ronald D. ; Ryoo, Shane ; Hwu, Wen-Mei W.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., George Mason Univ., Fairfax, VA, USA
  • fYear
    2005
  • fDate
    12-16 Nov. 2005
  • Abstract
    As microprocessor designs become increasingly power-and complexity-conscious, future microarchitectures must decrease their reliance on expensive dynamic scheduling structures. While compilers have generally proven adept at planning useful static instruction-level parallelism, relying solely on the compiler´s instruction execution arrangement performs poorly when cache misses occur, because variable latency is not well tolerated. This paper proposes a new micro architectural model, multipass pipelining, that exploits meticulous compile-time scheduling on simple in-order hardware while achieving excellent cache miss tolerance through persistent advance preexecution beyond otherwise stalled instructions. The pipeline systematically makes multiple passes through instructions that follow a stalled instruction. Each pass increases the speed and energy efficiency of the subsequent ones by preserving computed results. The concept of multiple passes and successive improvement of efficiency across passes in a single pipeline distinguishes multipass pipelining from other runahead schemes. Simulation results show that the multipass technique achieves 77% of the cycle reduction of aggressive out-of-order execution relative to in-order execution. In addition, micro architectural-level power simulation indicates that benefits of multipass are achieved at a fraction of the power overhead of full dynamic scheduling.
  • Keywords
    cache storage; microprocessor chips; pipeline processing; processor scheduling; program compilers; cache miss tolerance; compile-time scheduling; compilers; dynamic scheduling structures; instruction execution; instruction-level parallelism; microarchitectural model; microprocessor designs; multipass pipelining; Delay; Dynamic scheduling; Energy efficiency; Hardware; Microarchitecture; Microprocessors; Out of order; Parallel processing; Pipeline processing; Processor scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Microarchitecture, 2005. MICRO-38. Proceedings. 38th Annual IEEE/ACM International Symposium on
  • Print_ISBN
    0-7695-2440-0
  • Type

    conf

  • DOI
    10.1109/MICRO.2005.1
  • Filename
    1540970