Title of article :
A general parallel sparse-blocked matrix multiply for linear scaling SCF theory Original Research Article
Author/Authors :
Matt Challacombe، نويسنده ,
Issue Information :
دوهفته نامه با شماره پیاپی سال 2000
Pages :
15
From page :
93
To page :
107
Abstract :
A general approach to the parallel sparse-blocked matrix–matrix multiply is developed in the context of linear scaling self-consistent-field (SCF) theory. The data-parallel message passing method uses non-blocking communication to overlap computation and communication. The space filling curve heuristic is used to achieve data locality for sparse matrix elements that decay with “separation”. Load balance is achieved by solving the bin packing problem for blocks with variable size. With this new method as the kernel, parallel performance of the simplified density matrix minimization (SDMM) for solution of the SCF equations is investigated for RHF/6-31G ∗∗ water clusters and RHF/3-21G estane globules. Sustained rates above 5.7 GFLOPS for the SDMM have been achieved for (H 2 O) 200 with 95 Origin 2000 processors. Scalability is found to be limited by load imbalance, which increases with decreasing granularity, due primarily to the inhomogeneous distribution of variable block sizes.
Journal title :
Computer Physics Communications
Serial Year :
2000
Journal title :
Computer Physics Communications
Record number :
1135380
Link To Document :
بازگشت