DocumentCode :
402323
Title :
Unrolling shape for out-of-order processors
Author :
Hiroyuki, Sato
Author_Institution :
Inf. Technol. Center, Tokyo Univ., Japan
fYear :
2003
fDate :
37819
Firstpage :
88
Lastpage :
97
Abstract :
Loop unrolling is today one of the most effective optimizations for modern architectures. To give an analytical model for loop unrolling performance, unrolling shape was proposed. It was applied to in-order processors, and was proved to give an accurate performance model for loop unrolling in term of software pipelining and cache miss alleviation. In this paper, we apply unrolling shape to out-of-order processors. A scheme for calculating PLOOO, pipelining terms of an unrolled loop by factor l are presented as PLOOO(l) = {(Nins(l)/F + NOccpy(l))}/l, where Nins(l) is the number of instructions in an unrolled loop by factor l, F the fetch rate of the architecture, NOccpy(l) the number of store instructions scheduled after Nins(l)/F-th cycle. A pipelining term for in-order processors is essential for calculating NOccpy(l). It is to be noted that the scheme for out-of-order processors uses unrolling shape for in-order processors. Experiments show that our scheme is precise in calculating the behaviour of loop unrolling on out-of-order processors. We show that our scheme quantitatively shows the effect of loop unrolling as the one of infinitely unrolled loops on in-order processors. Furthermore, we reveal that the old folklore that the loop unrolling reduces the loop overhead has revived on out-of-order processors as a performance improvement factor as d/dlPLOOO (Aho et al., 1986).
Keywords :
instruction sets; optimising compilers; pipeline processing; program control structures; analytical model; cache miss alleviation; computer architectures; in-order processors; instruction set; loop unrolling performance; optimization; out-of-order processors; performance model; software pipelining; Out of order; Shape;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovative Architecture for Future Generation High-Performance Processors and Systems, 2003
ISSN :
1537-3223
Print_ISBN :
0-7695-2019-7
Type :
conf
DOI :
10.1109/IWIA.2003.1262786
Filename :
1262786
Link To Document :
بازگشت