DocumentCode :
2593104
Title :
Accelerating CFD applications by improving cached data reuse
Author :
Kadambi, Sudarshan ; Harden, Jim C.
Author_Institution :
NSF Eng. Res. Center for Comput. Field Simulation, Mississippi State Univ., MS, USA
fYear :
1995
fDate :
12-14 Mar 1995
Firstpage :
120
Lastpage :
124
Abstract :
As processors continue to experience relatively rapid clock speed increases, the gap widens between cpu and memory performance. Unlike other studies that collect memory traces and analyze them for compile time optimization or propose cache organization best suited for an application group, this paper tackles the problem at its roots, namely analyzing data access patterns and optimizing them before implementation. Optimization done by today´s compilers is mostly loop level. Function level optimization is limited to inlining code that often leads to poor instruction cache utilization, affecting code performance adversely. In this study, an algorithm to solve compressible Euler equations is studied with regard to temporal and spatial access of data. Data and instruction blocks, which are used most often, are isolated. The algorithm is then coded to utilize the characteristics of hierarchical memories with as much as 45% improvement over conventional optimization techniques
Keywords :
cache storage; fluid dynamics; mathematics computing; optimising compilers; performance evaluation; physics computing; software performance evaluation; CFD applications; cache organization; code performance; compile time optimization; compilers; compressible Euler equations; computational fluid dynamics; cpu; data access patterns; function level optimization; hierarchical memories; improving cached data reuse; instruction blocks; instruction cache utilization; loop level optimisation; memory performance; memory traces; optimization; optimization techniques; rapid clock speed increase; spatial access; temporal access; Acceleration; Clocks; Computational fluid dynamics; Computational modeling; Costs; Data analysis; Equations; Parallel processing; Pattern analysis; Pipeline processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Theory, 1995., Proceedings of the Twenty-Seventh Southeastern Symposium on
Conference_Location :
Starkville, MS
ISSN :
0094-2898
Print_ISBN :
0-8186-6985-3
Type :
conf
DOI :
10.1109/SSST.1995.390606
Filename :
390606
Link To Document :
بازگشت