• DocumentCode
    1242101
  • Title

    Evaluating performance tradeoffs between fine-grained and coarse-grained alternatives

  • Author

    Dubey, Pradeep IS ; Adams, George B., III ; Flynn, M.J.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    6
  • Issue
    1
  • fYear
    1995
  • fDate
    1/1/1995 12:00:00 AM
  • Firstpage
    17
  • Lastpage
    27
  • Abstract
    Recent simulation based studies suggest that while superpipelines and superscalars are equally capable of exploiting fine grained concurrency, multiprocessors are better at exploiting coarse grained parallelism. An analytical model that is more flexible and less costly in terms of run time than simulation, is proposed as a tool for analyzing the tradeoff between superpipelined processors, superscalar processors, and multiprocessors. The duality of superpipelines and superscalars is examined in detail. The performance limit for these systems has been derived and it supports the fetch bottleneck observation of previous researchers. Common characteristics of utilization curves for such systems are examined. Combined systems, such as superpipelined multiprocessors and superscalar multiprocessors, are also analyzed. The model shows that the number of pipelines (or processors) at which the maximum throughput is obtained is, as memory access time increases, increasingly sensitive to the ratio of memory access time to network access delay. Further, as a function of interiteration dependence distance, optimum throughput is shown to vary nonlinearly, whereas the corresponding optimum number of processors varies linearly. The predictions from the analytical model agree with similar results published using simulation based techniques
  • Keywords
    multiprocessing systems; parallel architectures; performance evaluation; pipeline processing; coarse-grained alternatives; duality; fetch bottleneck observation; interiteration dependence distance; memory access time; multiprocessors; network access delay; performance limit; performance tradeoffs; simulation based studies; superpipelines; superscalar processors; utilization curves; Analytical models; Assembly; Clocks; Concurrent computing; Delay effects; Microprocessors; NASA; Pipelines; Predictive models; Throughput;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/71.363414
  • Filename
    363414