• DocumentCode
    1497505
  • Title

    Parallel matrix multiplication on a linear array with a reconfigurable pipelined bus system

  • Author

    Li, Keqin ; Pan, Victor Y.

  • Author_Institution
    Dept. of Comput. Sci., State Univ. of New York, New Paltz, NY, USA
  • Volume
    50
  • Issue
    5
  • fYear
    2001
  • fDate
    5/1/2001 12:00:00 AM
  • Firstpage
    519
  • Lastpage
    525
  • Abstract
    The known fast sequential algorithms for multiplying two N×N matrices (over an arbitrary ring) have time complexity O(Nα ), where 2<α<3. The current best value of α is less than 2.3755. We show that, for all 1⩽p⩽Nα, multiplying two N×N matrices can be performed on a p-processor linear array with a reconfigurable pipelined bus system (LARPBS) in O(N m/P+(N2/p2α/)log p) time. This is currently the fastest parallelization of the best known sequential matrix multiplication algorithm on a distributed memory parallel system. In particular, for all 1⩽p⩽N2.3755, multiplying two N×N matrices can be performed on a p-processor LARPBS in O(N2.3755/p+(N2)/p0.8419log p) time and linear speedup can be achieved for p as large as O(N2.3755/(log N)6.3262). Furthermore, multiplying two N×N matrices can be performed on an LARPBS with O(Nα) processors in O(log N) time. This compares favorably with the performance on a PRAM
  • Keywords
    computational complexity; matrix multiplication; parallel algorithms; LARPBS; matrix multiplication; parallelization; reconfigurable pipelined bus system; time complexity; Arithmetic; Hypercubes; Optical arrays; Parallel algorithms; Phase change random access memory; Standards;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/12.926164
  • Filename
    926164