DocumentCode :
610867
Title :
Fast Reproducible Floating-Point Summation
Author :
Demmel, J. ; Hong Diep Nguyen
Author_Institution :
Math. Dept. & CS Div., Univ. of California at Berkeley, Berkeley, CA, USA
fYear :
2013
fDate :
7-10 April 2013
Firstpage :
163
Lastpage :
172
Abstract :
Reproducibility, i.e. getting the bitwise identical floating point results from multiple runs of the same program, is a property that many users depend on either for debugging or correctness checking in many codes [1]. However, the combination of dynamic scheduling of parallel computing resources, and floating point nonassociativity, make attaining reproducibility a challenge even for simple reduction operations like computing the sum of a vector of numbers in parallel. We propose a technique for floating point summation that is reproducible independent of the order of summation. Our technique uses Rump´s algorithm for error-free vector transformation [2], and is much more efficient than using (possibly very) high precision arithmetic. Our algorithm trades off efficiency and accuracy: we reproducibly attain reasonably accurate results (with an absolute error bound c · n2 · macheps · max |vi| for a small constant c) with just 2n + O(1) floating-point operations, and quite accurate results (with an absolute error bound c · n3 · macheps2 · max |vi| with 5n + O(1) floating point operations, both with just two reduction operations. Higher accuracies are also possible by increasing the number of error-free transformations. As long as the same rounding mode is used, results computed by the proposed algorithms are reproducible for any run on any platform.
Keywords :
floating point arithmetic; parallel processing; scheduling; Rump algorithm; correctness check; debugging; dynamic scheduling; error-free vector transformation; floating point nonassociativity; floating-point operation; floating-point summation; parallel computing resource; reduction operation; reproducibility property; rounding mode; summation order; Accuracy; Algorithm design and analysis; Educational institutions; Numerical analysis; Parallel processing; Program processors; Vectors; floating-point; parallelism; reproducibility; rounding error; summation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Arithmetic (ARITH), 2013 21st IEEE Symposium on
Conference_Location :
Austin, TX
ISSN :
1063-6889
Print_ISBN :
978-1-4673-5644-2
Type :
conf
DOI :
10.1109/ARITH.2013.9
Filename :
6545904
Link To Document :
بازگشت