DocumentCode :
1647534
Title :
On the performance of hand vs. automatically optimized numerical codes
Author :
Jimenez, M. ; Llaberia, J.M. ; Fernandez, Alicia
Author_Institution :
Dept. d´´Arquitectura de Comput. Univ. Politecnica de Catalunya, Barcelona, Spain
fYear :
2000
fDate :
6/22/1905 12:00:00 AM
Firstpage :
183
Lastpage :
194
Abstract :
In this paper, we compare automatic-optimized codes against hand-optimized codes. The automatic-optimized codes have been generated using our own developed tool that implements compiler techniques proposed in our previous work. Our compiler techniques focus on applying multilevel tiling to non-rectangular loop nests. This type of loop nests are commonly found in linear algebra algorithms, typically used in numerical codes. As hand-optimized codes, we use two different numerical libraries: the BLAS3 library provided by the manufacturers and the RISC-BLAS library proposed in Dayde and Duff (1998). Results will show how compiler technology can make it possible for non-rectangular loop nests to achieve as high performance as hand-optimized codes on modern microprocessors
Keywords :
numerical analysis; optimising compilers; software performance evaluation; BLAS3; RISC-BLAS library; automatic-optimized codes; compiler techniques; compiler technology; hand-optimized codes; multilevel tiling; non-rectangular loop nests; optimized codes; Feeds; Microarchitecture; Microprocessors; Multidimensional systems; Pipeline processing; Registers; Shape; Tiles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High-Performance Computer Architecture, 2000. HPCA-6. Proceedings. Sixth International Symposium on
Conference_Location :
Touluse
Print_ISBN :
0-7695-0550-3
Type :
conf
DOI :
10.1109/HPCA.2000.824349
Filename :
824349
Link To Document :
بازگشت