DocumentCode
2945215
Title
Deplump for Streaming Data
Author
Bartlett, Nicholas ; Wood, Frank
Author_Institution
Dept. of Stat., Columbia Univ., New York, NY, USA
fYear
2011
fDate
29-31 March 2011
Firstpage
363
Lastpage
372
Abstract
We present a general-purpose, loss less compressor for streaming data. This compressor is based on the deplump probabilistic compressor for batch data. Approximations to the inference procedure used in the probabilistic model underpinning deplump are introduced that yield the computational asyptotics necessary for stream compression. We demonstrate the performance of this streaming deplump variant relative to the batch compressor on a benchmark corpus and find that it performs equivalently well despite these approximations. We also explore the performance of the streaming variant on corpora that are too large to be compressed by batch deplump and demonstrate excellent compression performance.
Keywords
data compression; probability; batch compressor; batch data; deplump probabilistic compressor; inference procedure; probabilistic model underpinning deplump; stream compression; streaming data loss less compressor; streaming deplump variant relative; Approximation algorithms; Approximation methods; Complexity theory; Computational modeling; Context; Inference algorithms; Vegetation; Bayesian; Non-parameteric; sequence memoizer;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference (DCC), 2011
Conference_Location
Snowbird, UT
ISSN
1068-0314
Print_ISBN
978-1-61284-279-0
Type
conf
DOI
10.1109/DCC.2011.43
Filename
5749494
Link To Document