DocumentCode :
2343864
Title :
Special-purposed VLIW architecture for IEEE-754 quadruple precision elementary functions on FPGA
Author :
Lei, Yuanwu ; Dou, Yang ; Shen, Li ; Zhou, Jie ; Guo, Song
Author_Institution :
Nat. Lab. for Parallel&Distrib. Process., Nat. Univ. of Defense Technol., Changsha, China
fYear :
2011
fDate :
9-12 Oct. 2011
Firstpage :
219
Lastpage :
225
Abstract :
This work explores the feasibility to implement IEEE-754-2008 standard quadruple precision (Quad) elementary functions on recent FPGAs with plenty of embedded memories and DSP blocks. First, we analysis the implementation algorithm of Quad elementary functions in detail. Then, we present a special-purpose Very Large Instruction Word (VLIW) architecture for Quad elementary function (QE-Processor). The proposed processor uses a unified hardware structure, equipped with multiple basic arithmetic units, to implement various Quad algebraic and transcendental functions, in which several tradeoffs between latency and resource usage are carefully planned to avoid unbalanced resource utilization. The performance is improved through the explicitly parallel technology of custom VLIW instruction. Finally, we create a prototype of QE-Processor into Xilinx Virtex-5 and Virtex-6 FPGA chips. The experimental results show that our design can guarantee that the percentage of correct rounding is more than 99.9%. Moreover, the FPGA implementation on Virtex-6 XC6VLX760-2FF1760 FPGA, running at 220 MHz, outperforms the parallel software approach based on OpenMP running on an Intel Xeon E5620 CPU at 2.40GHz by a factor of 13X-20X for special function applications in Boost library.
Keywords :
IEEE standards; digital signal processing chips; field programmable gate arrays; logic design; multiprocessing systems; IEEE-754-2008 standard; Intel Xeon E5620 CPU; OpenMP; QE-Processor; Quad algebraic function; VLIW architecture; Virtex-6 FPGA chip; Xilinx Virtex-5; frequency 2.40 GHz; frequency 220 MHz; parallel software; quadruple precision elementary function; transcendental function; very large instruction word; Artificial intelligence; Lead; Random access memory; Strontium; System-on-a-chip;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Design (ICCD), 2011 IEEE 29th International Conference on
Conference_Location :
Amherst, MA
ISSN :
1063-6404
Print_ISBN :
978-1-4577-1953-0
Type :
conf
DOI :
10.1109/ICCD.2011.6081400
Filename :
6081400
Link To Document :
بازگشت