• DocumentCode
    2540622
  • Title

    Distributed modulo scheduling

  • Author

    Fernandes, Marcio Merino ; Llosa, Josep ; Topham, Nigel

  • Author_Institution
    Dept. of Comput. Sci., Edinburgh Univ., UK
  • fYear
    1999
  • fDate
    9-13 Jan 1999
  • Firstpage
    130
  • Lastpage
    134
  • Abstract
    Wide-issue ILP machines can be built using the VLIW approach as many of the hardware complexities found in superscalar processors can be transferred to the compiler. However, the scalability of VLIW architectures is still constrained by the size and number of ports of the register file required by a large number of functional units. Organizations composed of clusters of a few functional units and small private register files have been proposed to deal with this problem; an approach highly dependent on scheduling and partitioning strategies. The paper presents DMS, an algorithm that integrates modulo scheduling and code partitioning in a single procedure. Experimental results have shown that the algorithm is effective for configurations up to 8 clusters, or even more when targeting vectorizable loops
  • Keywords
    distributed algorithms; instruction sets; parallel architectures; parallel programming; processor scheduling; program compilers; DMS; VLIW approach; VLIW architectures; code partitioning; compiler; distributed modulo scheduling; functional units; hardware complexities; partitioning strategies; private register files; register file; superscalar processors; vectorizable loops; wide-issue ILP machines; Clocks; Computer architecture; Computer science; Data analysis; Microprocessors; Pipeline processing; Processor scheduling; Radio frequency; Registers; VLIW;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High-Performance Computer Architecture, 1999. Proceedings. Fifth International Symposium On
  • Conference_Location
    Orlando, FL
  • Print_ISBN
    0-7695-0004-8
  • Type

    conf

  • DOI
    10.1109/HPCA.1999.744349
  • Filename
    744349