DocumentCode
2010291
Title
Performance evaluation of tiling for the register level
Author
Jimenez, M. ; Llaberia, J.M. ; Fernandez, A.
Author_Institution
Dept. d´´Arquitectura de Computadors, Univ. Politecnica de Catalunya, Barcelona, Spain
fYear
1998
fDate
1-4 Feb 1998
Firstpage
254
Lastpage
265
Abstract
Tiling is a well-known loop transformation, which is basically used to expose coarse-grain parallelism and to exploit data reuse at the cache level. However, it can also be used to exploit data reuse at the register level and to improve programs´s ILP. Previous work on tiling and also commercial compilers are able to perform tiling for the register level in more than one dimension when the iteration space is rectangular. Non-rectangular iteration spaces are commonly found in linear algebra algorithms or can arise as a result of applying previous transformations such as loop skewing. In this paper we evaluate the technique presented in Jimenez et al. (1996) which is able to perform tiling for the register level in more than one dimension in both rectangular and non-rectangular iteration spaces. We use typical linear algebra algorithms having non-rectangular iteration spaces as benchmarks and compare our proposal against commercial preprocessors able to perform optimizing code transformations such as inner unrolling, outer unrolling and software pipelining. We will also present quantitative data showing the benefits of tiling only for the register level, tiling only for the cache level and tiling for both levels simultaneously. Results measured on a ALPHA 21164 processor show that tiling for both cache and register levels improves upon commercial compilers and preprocessors by factors in the range of 1.3 to 6.3
Keywords
performance evaluation; pipeline processing; ALPHA 21164 processor; cache level; data reuse; loop transformation; performance evaluation; register level; tiling; Computer architecture; Linear algebra; Memory management; Pipeline processing; Proposals; Registers; Scheduling; Software performance; Tiles;
fLanguage
English
Publisher
ieee
Conference_Titel
High-Performance Computer Architecture, 1998. Proceedings., 1998 Fourth International Symposium on
Conference_Location
Las Vegas, NV
Print_ISBN
0-8186-8323-6
Type
conf
DOI
10.1109/HPCA.1998.650564
Filename
650564
Link To Document