• DocumentCode
    1295928
  • Title

    An I/O Bandwidth-Sensitive Sparse Matrix-Vector Multiplication Engine on FPGAs

  • Author

    Sun, Song ; Monga, Madhu ; Jones, Phillip H. ; Zambreno, Joseph

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Iowa State Univ., Ames, IA, USA
  • Volume
    59
  • Issue
    1
  • fYear
    2012
  • Firstpage
    113
  • Lastpage
    123
  • Abstract
    Sparse matrix-vector multiplication (SMVM) is a fundamental core of many high-performance computing applications, including information retrieval, medical imaging, and economic modeling. While the use of reconfigurable computing technology in a high-performance computing environment has shown recent promise in accelerating a wide variety of scientific applications, existing SMVM architectures on FPGA hardware have been limited in that they require either numerous pipeline stalls during computation (due to zero padding) or excessive input preprocessing during run-time. For large-scale sparse matrix scenarios, both of these shortcomings can result in unacceptable performance overheads, limiting the overall value of using FPGAs in a high-performance computing environment. In this paper, we present a scalable and efficient FPGA-based SMVM architecture which can handle arbitrary matrix sparsity patterns without excessive preprocessing or zero padding and can be dynamically expanded based on the available I/O bandwidth. Our experimental results using a commercial FPGA-based acceleration system demonstrate that our reconfigurable SMVM engine is highly efficient, with benchmark-dependent speedups over an optimized software implementation that range from to in terms of computation time.
  • Keywords
    field programmable gate arrays; matrix multiplication; sparse matrices; vectors; FPGA-based SMVM architecture; FPGA-based acceleration system; I/O bandwidth-sensitive sparse matrix-vector multiplication engine; benchmark-dependent speedups; economic modeling; high-performance computing environment; information retrieval; matrix sparsity patterns; medical imaging; reconfigurable computing technology; zero padding; Adders; Clocks; Computer architecture; Engines; Field programmable gate arrays; Registers; Sparse matrices; FPGA; reconfigurable computing; sparse matrix-vector multiplication;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems I: Regular Papers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1549-8328
  • Type

    jour

  • DOI
    10.1109/TCSI.2011.2161389
  • Filename
    5982109