DocumentCode :
659442
Title :
Improving floating point compression through binary masks
Author :
Gomez, Leonardo A. Bautista ; Cappello, Franck
fYear :
2013
fDate :
6-9 Oct. 2013
Firstpage :
326
Lastpage :
331
Abstract :
Modern scientific technology such as particle accelerators, telescopes, and supercomputers are producing extremely large amounts of data. That scientific data needs to be processed by using systems with high computational capabilities such as supercomputers. Given that the scientific data is increasing in size at an exponential rate, storing and accessing the data are becoming expensive in both time and space. Most of this scientific data is stored by using floating point representation. Scientific applications executed on supercomputers spend a large amount of CPU cycles reading and writing floating point values, making data compression techniques an interesting way to increase computing efficiency. Given the accuracy requirements of scientific computing, we only focus on lossless data compression. In this paper we propose a masking technique that partially decreases the entropy of scientific datasets, allowing for a better compression ratio and higher throughput. We evaluate several data partitioning techniques for selective compression and compare these schemes with several existing compression strategies. Our approach shows up to 15% improvement in compression ratio while reducing the time spent in compression by half time in some cases.
Keywords :
data compression; natural sciences computing; parallel machines; storage management; CPU cycles; binary masks; computational capabilities; computing efficiency; data access; data compression techniques; data partitioning techniques; data storage; decrease scientific dataset entropy; exponential rate; floating point compression improvement; floating point representation; floating point value reading; floating point value writing; particle accelerators; scientific data; scientific technology; selective compression; supercomputers; telescopes; Data compression; Entropy; Stability analysis; Supercomputers; Thermal stability; Three-dimensional displays; Throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data, 2013 IEEE International Conference on
Conference_Location :
Silicon Valley, CA
Type :
conf
DOI :
10.1109/BigData.2013.6691591
Filename :
6691591
Link To Document :
بازگشت