• DocumentCode
    3053583
  • Title

    Optimistic Parallelization of Floating-Point Accumulation

  • Author

    Kapre, Nachiket ; DeHon, André

  • Author_Institution
    California Inst. of Technol., Pasadena
  • fYear
    2007
  • fDate
    25-27 June 2007
  • Firstpage
    205
  • Lastpage
    216
  • Abstract
    Floating-point arithmetic is notoriously nonassociative due to the limited precision representation which demands intermediate values be rounded to fit in the available precision. The resulting cyclic dependency in floating-point accumulation inhibits parallelization of the computation, including efficient use of pipelining. In practice, however, we observe that floating-point operations are "mostly" associative. This observation can be exploited to parallelize floating-point accumulation using a form of optimistic concurrency. In this scheme, we first compute an optimistic associative approximation to the sum and then relax the computation by iteratively propagating errors until the correct sum is obtained. We map this computation to a network of 16 statically-scheduled, pipelined, double-precision floating-point adders on the Virtex-4 LX160 (-12) device where each floating-point adder runs at 296 MHz and has a pipeline depth of 10. On this 16 PE design, we demonstrate an average speedup of 6times with randomly generated data and 3-7times with summations extracted from Conjugate Gradient benchmarks.
  • Keywords
    conjugate gradient methods; floating point arithmetic; Virtex-4 LX160 device; conjugate gradient benchmarks; cyclic dependency; floating-point accumulation; floating-point adders; floating-point arithmetic; optimistic associative approximation; optimistic parallelization; pipelining; Computer networks; Computer science; Concurrent computing; Data mining; Delay; Error correction; Floating-point arithmetic; Hardware; Pipeline processing; Systems engineering and theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Arithmetic, 2007. ARITH '07. 18th IEEE Symposium on
  • Conference_Location
    Montepellier
  • ISSN
    1063-6889
  • Print_ISBN
    0-7695-2854-6
  • Type

    conf

  • DOI
    10.1109/ARITH.2007.25
  • Filename
    4272867