• DocumentCode
    1492794
  • Title

    Improved Architectures for a Fused Floating-Point Add-Subtract Unit

  • Author

    Sohn, Jongwook ; Swartzlander, Earl E., Jr.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Texas at Austin, Austin, TX, USA
  • Volume
    59
  • Issue
    10
  • fYear
    2012
  • Firstpage
    2285
  • Lastpage
    2291
  • Abstract
    This paper presents improved architectures for a fused floating-point add-subtract unit. The fused floating-point add-subtract unit is useful for digital signal processing (DSP) applications such as fast Fourier transform (FFT) and discrete cosine transform (DCT) butterfly operations. To improve the performance of the fused floating-point add-subtract unit, a dual-path algorithm and pipelining are employed. The proposed designs are implemented for both single and double precision and synthesized with a 45-nm standard-cell library. The fused floating-point add-subtract unit saves 40% of the area and power consumption compared to a discrete floating-point add-subtract unit. The proposed dual-path design reduces the latency by 30% compared to the discrete design with area and power consumption between that of the discrete and fused designs. Based on a data flow analysis, the proposed fused dual-path floating-point add-subtract unit can be split into two pipeline stages. Since the latencies of two pipeline stages are fairly well balanced, the throughput is increased by 80% compared to the nonpipelined dual-path design.
  • Keywords
    adders; data flow analysis; digital signal processing chips; floating point arithmetic; DCT butterfly operation; DSP application; FFT; data flow analysis; digital signal processing; discrete cosine transform; discrete floating-point add-subtract unit; dual-path algorithm; fast Fourier transform; fused floating-point add-subtract unit; pipeline stages; power consumption; standard-cell library; Adders; Computer architecture; Digital signal processing; Discrete cosine transforms; Pipeline processing; Power demand; Throughput; Digital signal processing (DSP); floating-point arithmetic; fused floating-point operation; high-speed computer arithmetic;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems I: Regular Papers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1549-8328
  • Type

    jour

  • DOI
    10.1109/TCSI.2012.2188955
  • Filename
    6182723