• DocumentCode
    1992569
  • Title

    Quantifying instruction-level parallelism limits on an EPIC architecture

  • Author

    Lee, Hsien-Hsin ; Wu, Youfeng ; Tyson, Gary

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    21
  • Lastpage
    27
  • Abstract
    EPIC architectures rely heavily on state-of-the-art compiler technology to deliver optimal performance while keeping hardware design simple. It is generally believed that an optimizing compiler has an enormous scheduling window to exploit instruction-level parallelism (ILP) since the compiler orchestrates the entire program. Many state-of-the-art compilers typically confine optimizations to loop boundaries (e.g. software pipelining, trace scheduling, and loop unrolling) and function boundaries (e.g. loop peeling, loop exchanges, invariant hoisting, and global optimizations). Although techniques such as function inlining and interprocedural optimizations can alleviate these constraints to a limited extent, loop and function boundaries are often the real scopes of the compiler scheduler. Several previous ILP studies have explored the limits of parallelism on dynamic superscalar machines; however, those results are not applicable to EPIC architectures since they rely on dynamic scheduling, not static code scheduling by the compiler, to reorder instructions. In this paper, we evaluate the limits in ILP obtained through compiler scheduling alone. We quantify these limits as more restrictive scheduling constraints are imposed-starting from inter-procedural code scheduling, to intra-procedural and finally to loop-confined code scheduling
  • Keywords
    optimising compilers; parallel programming; processor scheduling; EPIC architecture; function boundaries; instruction-level parallelism; instruction-level parallelism limits; inter-procedural code scheduling; intra-procedural code scheduling; loop boundaries; loop-confined code scheduling; optimal performance; optimizing compiler; static code schedule; Delay effects; Hardware; Instruction sets; Logic; Pipeline processing; Processor scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Performance Analysis of Systems and Software, 2000. ISPASS. 2000 IEEE International Symposium on
  • Conference_Location
    Austin, TX
  • Print_ISBN
    0-7803-6418-X
  • Type

    conf

  • DOI
    10.1109/ISPASS.2000.842276
  • Filename
    842276