DocumentCode
1063746
Title
High-speed double-precision computation of reciprocal, division, square root, and inverse square root
Author
Pineiro, Jose-Alejandro ; Bruguera, Javier Diaz
Author_Institution
Dept. de Electron. y Comput., Santiago de Compostela Univ., Spain
Volume
51
Issue
12
fYear
2002
fDate
12/1/2002 12:00:00 AM
Firstpage
1377
Lastpage
1388
Abstract
A new method for the high-speed computation of double-precision floating-point reciprocal, division, square root, and inverse square root operations is presented in this paper. This method employs a second-degree minimax polynomial approximation to obtain an accurate initial estimate of the reciprocal and the inverse square root values, and then performs a modified Goldschmidt iteration. The high accuracy of the initial approximation allows us to obtain double-precision results by computing a single Goldschmidt iteration, significantly reducing the latency of the algorithm. Two unfolded architectures are proposed: the first one computing only reciprocal and division operations, and the second one also including the computation of square root and inverse square root. The execution times and area costs for both architectures are estimated, and a comparison with other multiplicative-based methods is presented. The results of this comparison show the achievement of a lower latency than these methods, with similar hardware requirements.
Keywords
floating point arithmetic; iterative methods; minimax techniques; polynomial approximation; accurate initial estimate; area costs; double-precision floating-point division operations; double-precision floating-point inverse square root operations; double-precision floating-point reciprocal operations; double-precision floating-point square root operations; execution times; hardware requirements; high-speed double-precision computation; latency; modified Goldschmidt iteration; multiplicative-based methods; second-degree minimax polynomial approximation; unfolded architectures; Approximation algorithms; Computer architecture; Convergence; Delay; Hardware; High performance computing; Iterative algorithms; Minimax techniques; Polynomials; Signal processing algorithms;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/TC.2002.1146704
Filename
1146704
Link To Document