• DocumentCode
    3210012
  • Title

    High Performance and Parallel Model for LU Decomposition on FPGAs

  • Author

    Shao, Yi ; Jiang, Liehui ; Zhao, Qiuxia ; Wang, Yuliang

  • Author_Institution
    Technol. R&D Center, China Nat. Digital Switching Syst. Eng., Zheng Zhou, China
  • fYear
    2009
  • fDate
    17-19 Dec. 2009
  • Firstpage
    75
  • Lastpage
    79
  • Abstract
    The matrix LU decomposition is the key kernel of the Linpack benchmark. Based on the analysis of the theory Gaussian elimination, this paper proposes a parallel model of implementing the LU decomposition on FPGAs. The model has the pivoting operation and different architecture with former designs. Experimental results show the floating-point performance of our designs achieves 3371.40 MFLOPS at 110 MHz clock frequency, which is 4 times greater than the microprocessor while operating at a clock frequency that is 27 times lower.
  • Keywords
    field programmable gate arrays; matrix algebra; FPGA; Gaussian elimination; Linpack benchmark; frequency 110 MHz; matrix LU decomposition; Clocks; Computer science; Equations; Field programmable gate arrays; Frequency; Kernel; Matrices; Matrix decomposition; Switching systems; Vectors; Gaussian Elimination; LU decomposition; field programmable gate array (FPGA); parallel; pivoting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontier of Computer Science and Technology, 2009. FCST '09. Fourth International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3932-4
  • Electronic_ISBN
    978-1-4244-5467-9
  • Type

    conf

  • DOI
    10.1109/FCST.2009.66
  • Filename
    5392935