DocumentCode :
424516
Title :
The Synergetic Effect of Compiler, Architecture, and Manual Optimizations on the Performance of CFD on Multiprocessors
Author :
Kuba, Masayuki ; Polychronopoulos, Constantine D. ; Gallivan, Kyle
Author_Institution :
University of Illinois at Urbana-Champaign
fYear :
1995
fDate :
1995
Firstpage :
72
Lastpage :
72
Abstract :
This paper discusses the comprehensive performance profiling, improvement and benchmarking of a Computational Fluid Dynamics code, one of the Grand Challenge applications, on three popular multiprocessors. In the process of analyzing performance we considered language, compiler, architecture, and algorithmic changes and quantified each of them and their incremental contribution to bottom-line performance. We demonstrate that parallelization alone cannot result in significant gains if the granularity of parallel threads and the effect of parallelization on data locality are not taken into account. Unlike benchmarking studies that often focus on the performance or effectiveness of parallelizing compilers on specific loop kernels, we used the entire CFD code to measure the global effectiveness of compilers and parallel architectures. We probed the performance bottlenecks in each case and derived solutions which eliminate or neutralize the performance inhibiting factors. The major conclusion of our work is that overall performance is extremely sensitive to the synergetic effects of compiler optimizations, algorithmic and code tuning, and architectural idiosyncrasies.
Keywords :
CFD (computational fluid dynamics); cache optimization; parallel algorithms and programs; parallel architectures; parallel processing; parallelizing compilers; performance evaluation; profiling and program tuning; program optimization; Algorithm design and analysis; Computational fluid dynamics; Optimizing compilers; Parallel algorithms; Parallel architectures; Parallel processing; Performance analysis; Permission; Program processors; Research and development; CFD (computational fluid dynamics); cache optimization; parallel algorithms and programs; parallel architectures; parallel processing; parallelizing compilers; performance evaluation; profiling and program tuning; program optimization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing, 1995. Proceedings of the IEEE/ACM SC95 Conference
Print_ISBN :
0-89791-816-9
Type :
conf
DOI :
10.1109/SUPERC.1995.242067
Filename :
1383209
Link To Document :
بازگشت