DocumentCode
3633237
Title
Performance comparison of single-precision SPICE Model-Evaluation on FPGA, GPU, Cell, and multi-core processors
Author
Nachiket Kapre;Andre DeHon
Author_Institution
Computer Science, California Institute of Technology, Pasadena 91125, USA
fYear
2009
Firstpage
65
Lastpage
72
Abstract
Automated code generation and performance tuning techniques for concurrent architectures such as GPUs, Cell and FPGAs can provide integer factor speedups over multi-core processor organizations for data-parallel, floating-point computation in SPICE Model-Evaluation. Our Verilog AMS compiler produces code for parallel evaluation of non-linear circuit models suitable for use in SPICE simulations where the same model is evaluated several times for all the devices in the circuit. Our compiler uses architecture specific parallelization strategies (OpenMP for multi-core, PThreads for Cell, CUDA for GPU, statically scheduled VLIW for FPGA) when producing code for these different architectures. We automatically explore different implementation configurations (e.g. unroll factor, vector length) using our performance-tuner to identify the best possible configuration for each architecture. We demonstrate speedups of 3− 182× for a Xilinx Virtex5 LX 330T, 1.3−33× for an IBM Cell, and 3−131× for an NVIDIA 9600 GT GPU over a 3 GHz Intel Xeon 5160 implementation for a variety of singleprecision device models.
Keywords
"SPICE","Field programmable gate arrays","Multicore processing","Computer architecture","Computational modeling","Circuit optimization","Concurrent computing","Hardware design languages","Circuit simulation","Scheduling"
Publisher
ieee
Conference_Titel
Field Programmable Logic and Applications, 2009. FPL 2009. International Conference on
ISSN
1946-147X
Electronic_ISBN
1946-1488
Type
conf
DOI
10.1109/FPL.2009.5272548
Filename
5272548
Link To Document