DocumentCode
3029142
Title
Optimizing finite element programs on the Cray X1 using coloring schemes
Author
Tracy, Fred T.
Author_Institution
Eng. Res. & Dev. Center, USACE, Vicksburg, MS, USA
fYear
2004
fDate
7-11 June 2004
Firstpage
313
Lastpage
317
Abstract
Using the Environmental Quality Modeling program FEMWATER as a test-bed code, 27 percent of the time needed to run a given groundwater flow application on the ERDC Cray X1 using four multistream processors (MSPs) was spent assembling the global stiffness matrix. This poor performance is because the above code cannot multistream without help. The technique of "coloring" the elements makes it possible to multistream this section of the code, thus taking advantage of the hardware capability of the machine. Coloring for assembling the global stiffness matrix involves dividing the elements into different groups such that no node point touches any elements with the same color. This paper presents a simple coloring algorithm in FORTRAN and show how it was implemented into FEMWATER to achieve multistreaming on the ERDC Cray X1. It then gives a detailed description on how the program was modified, what compiler options were used, and what compiler directives worked best. Finally, timing results are given. Some programs that have good MPI (or equivalent) communication are better suited for running in the single-streamed processor (SSP) mode. In the SSP mode, coloring of the elements is not needed for assembling the global stiffness matrix. Timings for running in the SSP mode are shown, too.
Keywords
Cray computers; FORTRAN; Galerkin method; computational fluid dynamics; finite element analysis; groundwater; matrix algebra; message passing; multiprocessing systems; ERDC Cray X1; Environmental Quality Modeling program; FEMWATER; FORTRAN; MPI communication; coloring algorithm; compiler options; finite element programs; global stiffness matrix; groundwater flow application; multistream processors; single-streamed processor mode; Algorithms; Assembly; Finite element methods; Hardware; Nonlinear equations; Program processors; Research and development; Soil; Testing; Timing;
fLanguage
English
Publisher
ieee
Conference_Titel
Users Group Conference (DOD_UGC'04), 2004
Conference_Location
Williamsburg, VA, USA
Print_ISBN
0-7695-2259-9
Type
conf
DOI
10.1109/DOD_UGC.2004.31
Filename
1420891
Link To Document