High-speed double-precision computation of reciprocal, division, square root, and inverse square root

Author

Pineiro, Jose-Alejandro ; Bruguera, Javier Diaz

Author_Institution

Dept. de Electron. y Comput., Santiago de Compostela Univ., Spain

Volume

51

Issue

12

fYear

2002

fDate

12/1/2002 12:00:00 AM

Firstpage

1377

Lastpage

1388

Abstract

A new method for the high-speed computation of double-precision floating-point reciprocal, division, square root, and inverse square root operations is presented in this paper. This method employs a second-degree minimax polynomial approximation to obtain an accurate initial estimate of the reciprocal and the inverse square root values, and then performs a modified Goldschmidt iteration. The high accuracy of the initial approximation allows us to obtain double-precision results by computing a single Goldschmidt iteration, significantly reducing the latency of the algorithm. Two unfolded architectures are proposed: the first one computing only reciprocal and division operations, and the second one also including the computation of square root and inverse square root. The execution times and area costs for both architectures are estimated, and a comparison with other multiplicative-based methods is presented. The results of this comparison show the achievement of a lower latency than these methods, with similar hardware requirements.

Keywords

floating point arithmetic; iterative methods; minimax techniques; polynomial approximation; accurate initial estimate; area costs; double-precision floating-point division operations; double-precision floating-point inverse square root operations; double-precision floating-point reciprocal operations; double-precision floating-point square root operations; execution times; hardware requirements; high-speed double-precision computation; latency; modified Goldschmidt iteration; multiplicative-based methods; second-degree minimax polynomial approximation; unfolded architectures; Approximation algorithms; Computer architecture; Convergence; Delay; Hardware; High performance computing; Iterative algorithms; Minimax techniques; Polynomials; Signal processing algorithms;

fLanguage

English

Journal_Title

Computers, IEEE Transactions on

Publisher

ieee

ISSN

0018-9340

Type

jour

DOI

10.1109/TC.2002.1146704

Filename

1146704