• DocumentCode
    3712398
  • Title

    Exploring early and late ALUs for single-issue in-order pipelines

  • Author

    Alen Bardizbanyan;Per Larsson-Edefors

  • Author_Institution
    Chalmers University of Technology, Gothenburg, Sweden
  • fYear
    2015
  • Firstpage
    543
  • Lastpage
    548
  • Abstract
    In-order processors are key components in energy-efficient embedded systems. One important design aspect of inorder pipelines is the sequence of pipeline stages: First, the position of the execute stage, in which arithmetic logic unit (ALU) operations and branch prediction are handled, impacts the number of stall cycles that are caused by data dependencies between data memory instructions and their consuming instructions and by address generation instructions that depend on an ALU result. Second, the position of the ALU inside the pipeline impacts the branch penalty. This paper considers the question on how to best make use of ALU resources inside a single-issue in-order pipeline. We begin by analyzing which is the most efficient way of placing a single ALU in an in-order pipeline. We then go on to evaluate what is the most efficient way to make use of two ALUs, one early and one late ALU, which is a technique that has revitalized commercial in-order processors in recent years. Our architectural simulations, which are based on 20 MiBench and 7 SPEC2000 integer benchmarks and a 65-nm postlayout netlist of a complete pipeline, show that utilizing two ALUs in different stages of the pipeline gives better performance and energy efficiency than any other pipeline configuration with a single ALU.
  • Keywords
    "Pipelines","Program processors","Benchmark testing","Clocks","Optimization","Integrated circuit modeling"
  • Publisher
    ieee
  • Conference_Titel
    Computer Design (ICCD), 2015 33rd IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/ICCD.2015.7357163
  • Filename
    7357163