DocumentCode :
3686768
Title :
Reproducible floating-point atomic addition in data-parallel environment
Author :
David Defour;Sylvain Collange
Author_Institution :
Laboratoire DALI-LIRMM, 52 avenue Paul Alduy, 66860 Perpignan Cerdex - France
fYear :
2015
Firstpage :
721
Lastpage :
728
Abstract :
Floating-point additions in concurrent execution environment are known to be hazardous, as the result depends on the order in which operations are performed. This problem is encountered in data parallel execution environments such as GPUs, where reproducibility involving floating-point atomic addition is challenging. This problem is due to the rounding error or cancellation that appears for each operation, combined with the lack of control over execution order. In this article we propose two solutions to address this problem: work reassignment and fixed-point accumulation. Work reassignment consists in enforcing an execution order that leads to weak reproducibility. Fixed-point accumulation consists in avoiding rounding errors altogether thanks to a long accumulator and enables strong reproducibility.
Keywords :
"Graphics processing units","Indexes","Hardware","Kernel","Synchronization","Message systems","Instruction sets"
Publisher :
ieee
Conference_Titel :
Computer Science and Information Systems (FedCSIS), 2015 Federated Conference on
Type :
conf
DOI :
10.15439/2015F86
Filename :
7321514
Link To Document :
بازگشت