• DocumentCode
    796803
  • Title

    Architecture and Compiler Optimizations for Data Bandwidth Improvement in Configurable Processors

  • Author

    Cong, Jason ; Han, Guoling ; Zhang, Zhiru

  • Author_Institution
    Dept. of Comput. Sci., California Univ., Los Angeles, CA
  • Volume
    14
  • Issue
    9
  • fYear
    2006
  • Firstpage
    986
  • Lastpage
    997
  • Abstract
    Many commercially available embedded processors are capable of extending their base instruction set for a specific domain of applications. While steady progress has been made in the tools and methodologies of automatic instruction set extension for configurable processors, the limited data bandwidth available in the core processor (e.g., the number of simultaneous accesses to the register file) becomes a potential performance bottleneck. In this paper, we first present a quantitative analysis of the data bandwidth limitation in configurable processors, and then propose a novel low-cost architectural extension and associated compilation techniques to address the problem. Specifically, we embed a single control bit in the instruction op-codes to selectively copy the execution results to a set of hash-mapped shadow registers in the write-back stage. This can efficiently reduce the communication overhead due to data transfers between the core processor and the custom logic. We also present a novel simultaneous global shadow register binding with a hash function generation algorithm to take full advantage of the extension. The application of our approach leads to a nearly optimal performance speedup
  • Keywords
    optimising compilers; reconfigurable architectures; shift registers; architectural extension; base instruction set; compilation techniques; compiler optimizations; configurable processors; data bandwidth improvement; embedded processors; execution results; hash function generation algorithm; hash-mapped shadow registers; instruction op-codes; performance bottleneck; simultaneous global shadow register binding; write-back stage; Application specific processors; Bandwidth; Communication system control; Data analysis; Hardware; Microarchitecture; Optimizing compilers; Reconfigurable architectures; Reconfigurable logic; Registers; Data bandwidth optimization; optimizing compilers; reconfigurable architectures; register allocation;
  • fLanguage
    English
  • Journal_Title
    Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-8210
  • Type

    jour

  • DOI
    10.1109/TVLSI.2006.884050
  • Filename
    1715331