• DocumentCode
    3092389
  • Title

    The improved BiCGStab method for large and sparse unsymmetric linear systems on parallel distributed memory architectures

  • Author

    Yang, Laurence Tianruo ; Brent, Richard P.

  • fYear
    2002
  • fDate
    23-25 Oct. 2002
  • Firstpage
    324
  • Lastpage
    328
  • Abstract
    In this paper, an improved version of the BiCGStab (IBiCGStab) method for the solutions of large and sparse linear systems of equations with unsymmetric coefficient matrices is proposed. The method combines elements of numerical stability and parallel algorithm design without increasing the computational costs. The algorithm is derived such that all inner products of a single iteration step are independent and communication time required for the inner product can be overlapped efficiently with computation time of vector updates. Therefore, the cost of global communication which represents the bottleneck of the parallel performance can be significantly reduced. The resulting IBiCGStab algorithm maintains the favorable properties of the original method while not increasing computational costs. Data distribution suitable for both irregularly and regularly structured matrices based on the analysis of the nonzero matrix elements is presented. Communication scheme is supported by overlapping execution of computation and communication to reduce waiting times. The efficiency of this method is demonstrated by numerical experimental results carried out on a massively parallel distributed memory system.
  • Keywords
    distributed memory systems; mathematics computing; numerical stability; parallel algorithms; parallel architectures; performance evaluation; sparse matrices; BiCGStab method; bottleneck; computation time; computational costs; experimental results; large sparse unsymmetric linear systems; massively parallel distributed memory system; nonzero matrix elements; numerical stability; parallel algorithm design; parallel architectures; unsymmetric coefficient matrices; Algorithm design and analysis; Computational efficiency; Costs; Equations; Global communication; Linear systems; Memory architecture; Numerical stability; Parallel algorithms; Sparse matrices;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Algorithms and Architectures for Parallel Processing, 2002. Proceedings. Fifth International Conference on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    0-7695-1512-6
  • Type

    conf

  • DOI
    10.1109/ICAPP.2002.1173595
  • Filename
    1173595