• DocumentCode
    3111131
  • Title

    Communication Avoiding Gaussian elimination

  • Author

    Grigori, Laura ; Demmel, James ; Xiang, Hia

  • Author_Institution
    INRIA Saclay-Ile de France, Univ. Paris-Sud 11, Orsay, France
  • fYear
    2008
  • fDate
    15-21 Nov. 2008
  • Firstpage
    1
  • Lastpage
    12
  • Abstract
    We present CALU, a Communication Avoiding algorithm for the LU factorization of dense matrices distributed in a two-dimensional cyclic layout. The algorithm is based on a new pivoting strategy, which is stable in practice. The new algorithm is optimal (up to polylogarithmic factors) in the amount of communication it performs. Our experiments show that CALU leads to a reduction in the parallel time, in particular when the latency time is an important factor of the overall time. The factorization of a block-column, a subroutine of CALU, outperforms the corresponding routine PDGETF2 from ScaLAPACK up to a factor of 4.37 on an IBM POWER 5 system and up to a factor of 5.58 on a Cray XT4 system. On square matrices of order 104, CALU outperforms the corresponding routine PDGETRF from ScaLAPACK by a factor of 1.24 on IBM POWER 5 and by a factor of 1.31 on Cray XT4.
  • Keywords
    matrix decomposition; parallel algorithms; Cray XT4 system; Gaussian elimination; IBM POWER 5 system; PDGETF2; ScaLAPACK; block-column factorization; communication avoiding algorithm; dense matrices factorization; latency time; two-dimensional cyclic layout; Algorithms; Delay;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis, 2008. SC 2008. International Conference for
  • Conference_Location
    Austin, TX
  • Print_ISBN
    978-1-4244-2834-2
  • Electronic_ISBN
    978-1-4244-2835-9
  • Type

    conf

  • DOI
    10.1109/SC.2008.5214287
  • Filename
    5214287