• DocumentCode
    3029142
  • Title

    Optimizing finite element programs on the Cray X1 using coloring schemes

  • Author

    Tracy, Fred T.

  • Author_Institution
    Eng. Res. & Dev. Center, USACE, Vicksburg, MS, USA
  • fYear
    2004
  • fDate
    7-11 June 2004
  • Firstpage
    313
  • Lastpage
    317
  • Abstract
    Using the Environmental Quality Modeling program FEMWATER as a test-bed code, 27 percent of the time needed to run a given groundwater flow application on the ERDC Cray X1 using four multistream processors (MSPs) was spent assembling the global stiffness matrix. This poor performance is because the above code cannot multistream without help. The technique of "coloring" the elements makes it possible to multistream this section of the code, thus taking advantage of the hardware capability of the machine. Coloring for assembling the global stiffness matrix involves dividing the elements into different groups such that no node point touches any elements with the same color. This paper presents a simple coloring algorithm in FORTRAN and show how it was implemented into FEMWATER to achieve multistreaming on the ERDC Cray X1. It then gives a detailed description on how the program was modified, what compiler options were used, and what compiler directives worked best. Finally, timing results are given. Some programs that have good MPI (or equivalent) communication are better suited for running in the single-streamed processor (SSP) mode. In the SSP mode, coloring of the elements is not needed for assembling the global stiffness matrix. Timings for running in the SSP mode are shown, too.
  • Keywords
    Cray computers; FORTRAN; Galerkin method; computational fluid dynamics; finite element analysis; groundwater; matrix algebra; message passing; multiprocessing systems; ERDC Cray X1; Environmental Quality Modeling program; FEMWATER; FORTRAN; MPI communication; coloring algorithm; compiler options; finite element programs; global stiffness matrix; groundwater flow application; multistream processors; single-streamed processor mode; Algorithms; Assembly; Finite element methods; Hardware; Nonlinear equations; Program processors; Research and development; Soil; Testing; Timing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Users Group Conference (DOD_UGC'04), 2004
  • Conference_Location
    Williamsburg, VA, USA
  • Print_ISBN
    0-7695-2259-9
  • Type

    conf

  • DOI
    10.1109/DOD_UGC.2004.31
  • Filename
    1420891