• DocumentCode
    1063746
  • Title

    High-speed double-precision computation of reciprocal, division, square root, and inverse square root

  • Author

    Pineiro, Jose-Alejandro ; Bruguera, Javier Diaz

  • Author_Institution
    Dept. de Electron. y Comput., Santiago de Compostela Univ., Spain
  • Volume
    51
  • Issue
    12
  • fYear
    2002
  • fDate
    12/1/2002 12:00:00 AM
  • Firstpage
    1377
  • Lastpage
    1388
  • Abstract
    A new method for the high-speed computation of double-precision floating-point reciprocal, division, square root, and inverse square root operations is presented in this paper. This method employs a second-degree minimax polynomial approximation to obtain an accurate initial estimate of the reciprocal and the inverse square root values, and then performs a modified Goldschmidt iteration. The high accuracy of the initial approximation allows us to obtain double-precision results by computing a single Goldschmidt iteration, significantly reducing the latency of the algorithm. Two unfolded architectures are proposed: the first one computing only reciprocal and division operations, and the second one also including the computation of square root and inverse square root. The execution times and area costs for both architectures are estimated, and a comparison with other multiplicative-based methods is presented. The results of this comparison show the achievement of a lower latency than these methods, with similar hardware requirements.
  • Keywords
    floating point arithmetic; iterative methods; minimax techniques; polynomial approximation; accurate initial estimate; area costs; double-precision floating-point division operations; double-precision floating-point inverse square root operations; double-precision floating-point reciprocal operations; double-precision floating-point square root operations; execution times; hardware requirements; high-speed double-precision computation; latency; modified Goldschmidt iteration; multiplicative-based methods; second-degree minimax polynomial approximation; unfolded architectures; Approximation algorithms; Computer architecture; Convergence; Delay; Hardware; High performance computing; Iterative algorithms; Minimax techniques; Polynomials; Signal processing algorithms;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2002.1146704
  • Filename
    1146704