DocumentCode
1954895
Title
Parallel lossless data compression on the GPU
Author
Patel, Ritesh A. ; Zhang, Yao ; Mak, Jason ; Davidson, Andrew ; Owens, John D.
Author_Institution
Univ. of California, Davis, Davis, CA, USA
fYear
2012
fDate
13-14 May 2012
Firstpage
1
Lastpage
9
Abstract
We present parallel algorithms and implementations of a bzip2-like lossless data compression scheme for GPU architectures. Our approach parallelizes three main stages in the bzip2 compression pipeline: Burrows-Wheeler transform (BWT), move-to-front transform (MTF), and Huffman coding. In particular, we utilize a two-level hierarchical sort for BWT, design a novel scan-based parallel MTF algorithm, and implement a parallel reduction scheme to build the Huffman tree. For each algorithm, we perform detailed performance analysis, discuss its strengths and weaknesses, and suggest future directions for improvements. Overall, our GPU implementation is dominated by BWT performance and is 2.78× slower than bzip2, with BWT and MTF-Huffman respectively 2.89× and 1.34× slower on average.
Keywords
Huffman codes; data compression; graphics processing units; parallel algorithms; telecommunication computing; trees (mathematics); BWT; Burrows-Wheeler transform; GPU architectures; Huffman coding; Huffman tree; MTF; bzip2-like lossless data compression scheme; move-to-front transform; parallel lossless data compression; parallel reduction scheme; scan-based parallel MTF algorithm; Arrays; Encoding; Graphics processing unit; Histograms; Humans; Instruction sets; Transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovative Parallel Computing (InPar), 2012
Conference_Location
San Jose, CA
Print_ISBN
978-1-4673-2632-2
Electronic_ISBN
978-1-4673-2631-5
Type
conf
DOI
10.1109/InPar.2012.6339599
Filename
6339599
Link To Document