مرکز منطقه ای اطلاع رساني علوم و فناوري - Fast Reproducible Floating-Point Summation

DocumentCode :

610867

Title :

Fast Reproducible Floating-Point Summation

Author :

Demmel, J. ; Hong Diep Nguyen

Author_Institution :

Math. Dept. & CS Div., Univ. of California at Berkeley, Berkeley, CA, USA

fYear :

2013

fDate :

7-10 April 2013

Firstpage :

163

Lastpage :

172

Abstract :

Reproducibility, i.e. getting the bitwise identical floating point results from multiple runs of the same program, is a property that many users depend on either for debugging or correctness checking in many codes [1]. However, the combination of dynamic scheduling of parallel computing resources, and floating point nonassociativity, make attaining reproducibility a challenge even for simple reduction operations like computing the sum of a vector of numbers in parallel. We propose a technique for floating point summation that is reproducible independent of the order of summation. Our technique uses Rump´s algorithm for error-free vector transformation [2], and is much more efficient than using (possibly very) high precision arithmetic. Our algorithm trades off efficiency and accuracy: we reproducibly attain reasonably accurate results (with an absolute error bound c · n² · macheps · max |v_i| for a small constant c) with just 2n + O(1) floating-point operations, and quite accurate results (with an absolute error bound c · n³ · macheps² · max |v_i| with 5n + O(1) floating point operations, both with just two reduction operations. Higher accuracies are also possible by increasing the number of error-free transformations. As long as the same rounding mode is used, results computed by the proposed algorithms are reproducible for any run on any platform.

Keywords :

floating point arithmetic; parallel processing; scheduling; Rump algorithm; correctness check; debugging; dynamic scheduling; error-free vector transformation; floating point nonassociativity; floating-point operation; floating-point summation; parallel computing resource; reduction operation; reproducibility property; rounding mode; summation order; Accuracy; Algorithm design and analysis; Educational institutions; Numerical analysis; Parallel processing; Program processors; Vectors; floating-point; parallelism; reproducibility; rounding error; summation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Arithmetic (ARITH), 2013 21st IEEE Symposium on

Conference_Location :

Austin, TX

ISSN :

1063-6889

Print_ISBN :

978-1-4673-5644-2

Type :

conf

DOI :

10.1109/ARITH.2013.9

Filename :

6545904

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=610867