Title :
Hardware/Software Co-Design for Matrix Computations on Reconfigurable Computing Systems
Author :
Zhuo, Ling ; Prasanna, Viktor K.
Author_Institution :
Dept. of Comput. Sci., Univ. of Southern California, CA
Abstract :
Recently, reconfigurable computing systems have been built which employ field-programmable gate arrays (FPGAs) as hardware accelerators for general-purpose processors. These systems provide new opportunities for scientific computations. However, the co-existence of the processors and the FPGAs in such systems also poses new challenges to application developers. In this paper, we investigate a design model for hybrid designs, that is, designs that utilize both the processors and the FPGAs. The model characterizes a reconfigurable computing system using various system parameters, including the floating-point computing power of the processor and the FPGA, the number of nodes, the memory bandwidth and the network bandwidth. Using the model, we investigate hardware/software co-design for two computationally intensive applications: matrix factorization and all-pairs shortest-paths problem. Our designs balance the load between the processor and the FPGA, as well as overlap the computation time with memory transfer time and network communication time. The proposed designs are implemented on 6 nodes in a Cray XD1 chassis. Our implementations achieve 20 GFLOPS and 6.6 GFLOPS for these two applications, respectively.
Keywords :
field programmable gate arrays; hardware-software codesign; matrix decomposition; reconfigurable architectures; all-pairs shortest-paths problem; field-programmable gate arrays; floating-point computing power; hardware accelerator; hardware-software co-design; matrix factorisation; memory bandwidth; memory transfer time; network bandwidth; network communication time; reconfigurable computing systems; Application software; Bandwidth; Computer networks; Concurrent computing; Field programmable gate arrays; Hardware; Parallel processing; Power system modeling; Process design; Software performance;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
Conference_Location :
Long Beach, CA
Print_ISBN :
1-4244-0910-1
Electronic_ISBN :
1-4244-0910-1
DOI :
10.1109/IPDPS.2007.370268