DocumentCode
3111131
Title
Communication Avoiding Gaussian elimination
Author
Grigori, Laura ; Demmel, James ; Xiang, Hia
Author_Institution
INRIA Saclay-Ile de France, Univ. Paris-Sud 11, Orsay, France
fYear
2008
fDate
15-21 Nov. 2008
Firstpage
1
Lastpage
12
Abstract
We present CALU, a Communication Avoiding algorithm for the LU factorization of dense matrices distributed in a two-dimensional cyclic layout. The algorithm is based on a new pivoting strategy, which is stable in practice. The new algorithm is optimal (up to polylogarithmic factors) in the amount of communication it performs. Our experiments show that CALU leads to a reduction in the parallel time, in particular when the latency time is an important factor of the overall time. The factorization of a block-column, a subroutine of CALU, outperforms the corresponding routine PDGETF2 from ScaLAPACK up to a factor of 4.37 on an IBM POWER 5 system and up to a factor of 5.58 on a Cray XT4 system. On square matrices of order 104, CALU outperforms the corresponding routine PDGETRF from ScaLAPACK by a factor of 1.24 on IBM POWER 5 and by a factor of 1.31 on Cray XT4.
Keywords
matrix decomposition; parallel algorithms; Cray XT4 system; Gaussian elimination; IBM POWER 5 system; PDGETF2; ScaLAPACK; block-column factorization; communication avoiding algorithm; dense matrices factorization; latency time; two-dimensional cyclic layout; Algorithms; Delay;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis, 2008. SC 2008. International Conference for
Conference_Location
Austin, TX
Print_ISBN
978-1-4244-2834-2
Electronic_ISBN
978-1-4244-2835-9
Type
conf
DOI
10.1109/SC.2008.5214287
Filename
5214287
Link To Document