DocumentCode
1058485
Title
Instruction window size trade-offs and characterization of program parallelism
Author
Dubey, Pradeep K. ; Adams, George B., III ; Flynn, Michael J.
Author_Institution
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume
43
Issue
4
fYear
1994
fDate
4/1/1994 12:00:00 AM
Firstpage
431
Lastpage
442
Abstract
Detecting independent operations is a prime objective for computers that are capable of issuing and executing multiple operations simultaneously. The number of instructions that are simultaneously examined for detecting those that are independent is the scope of concurrency detection. The authors present an analytical model for predicting the performance impact of varying the scope of concurrency detection as a function of available resources, such as number of pipelines in a superscalar architecture. The model developed can show where a performance bottleneck might be: insufficient resources to exploit discovered parallelism, insufficient instruction stream parallelism, or insufficient scope of concurrency detection. The cost associated with speculative execution is examined via a set of probability distributions that characterize the inherent parallelism in the instruction stream. These results were derived using traces from a Multiflow TRACE SCHEDULING compacting FORTRAN 77 and C compilers. The experiments provide misprediction delay estimates for 11 common application-level benchmarks under scope constraints, assuming speculative, out-of-order execution and run time scheduling. The throughput prediction of the analytical model is shown to be close to the measured static throughput of the compiler output
Keywords
concurrency control; parallel programming; performance evaluation; program compilers; scheduling; Multiflow TRACE SCHEDULING; characterization; compilers; concurrency detection; delay estimates; inherent parallelism; instruction stream parallelism; instruction window size; parallelism; performance bottleneck; performance impact; probability distributions; program parallelism; run time scheduling; scope constraints; throughput prediction; trade-offs; Analytical models; Concurrent computing; Costs; Delay effects; Delay estimation; Out of order; Parallel processing; Pipelines; Probability distribution; Throughput;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/12.278481
Filename
278481
Link To Document