DocumentCode :
1058485
Title :
Instruction window size trade-offs and characterization of program parallelism
Author :
Dubey, Pradeep K. ; Adams, George B., III ; Flynn, Michael J.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
43
Issue :
4
fYear :
1994
fDate :
4/1/1994 12:00:00 AM
Firstpage :
431
Lastpage :
442
Abstract :
Detecting independent operations is a prime objective for computers that are capable of issuing and executing multiple operations simultaneously. The number of instructions that are simultaneously examined for detecting those that are independent is the scope of concurrency detection. The authors present an analytical model for predicting the performance impact of varying the scope of concurrency detection as a function of available resources, such as number of pipelines in a superscalar architecture. The model developed can show where a performance bottleneck might be: insufficient resources to exploit discovered parallelism, insufficient instruction stream parallelism, or insufficient scope of concurrency detection. The cost associated with speculative execution is examined via a set of probability distributions that characterize the inherent parallelism in the instruction stream. These results were derived using traces from a Multiflow TRACE SCHEDULING compacting FORTRAN 77 and C compilers. The experiments provide misprediction delay estimates for 11 common application-level benchmarks under scope constraints, assuming speculative, out-of-order execution and run time scheduling. The throughput prediction of the analytical model is shown to be close to the measured static throughput of the compiler output
Keywords :
concurrency control; parallel programming; performance evaluation; program compilers; scheduling; Multiflow TRACE SCHEDULING; characterization; compilers; concurrency detection; delay estimates; inherent parallelism; instruction stream parallelism; instruction window size; parallelism; performance bottleneck; performance impact; probability distributions; program parallelism; run time scheduling; scope constraints; throughput prediction; trade-offs; Analytical models; Concurrent computing; Costs; Delay effects; Delay estimation; Out of order; Parallel processing; Pipelines; Probability distribution; Throughput;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/12.278481
Filename :
278481
Link To Document :
بازگشت