Title :
Practical Parallel Lempel-Ziv Factorization
Author :
Julian Shun ; Fuyao Zhao
Abstract :
In the age of big data, the need for efficient data compression algorithms has grown. A widely used data compression method is the Lempel-Ziv-77 (LZ77) method, being a subroutine in popular compression packages such as gzip and PKZIP. There has been a lot of recent effort on developing practical sequential algorithms for Lempel-Ziv factorization (equivalent to LZ77 compression), but research in practical parallel implementations has been less satisfactory. In this work, we present a simple work-efficient parallel algorithm for Lempel-Ziv factorization. We show theoretically that our algorithm requires linear work and runs in O(log2 n) time (randomized) for constant alphabets and O(nϵ) time (ϵ <; 1) for integer alphabets. We present experimental results showing that our algorithm is efficient and achieves good speedup with respect to the best sequential implementations of Lempel-Ziv factorization.
Keywords :
computational complexity; data compression; parallel algorithms; randomised algorithms; LZ77 method; Lempel-Ziv-77 method; O(log2 n) time; O(nϵ) time; PKZIP; data compression algorithm; integer alphabet; parallel Lempel-Ziv factorization; sequential algorithm; Algorithm design and analysis; Data compression; Dictionaries; Indexes; Parallel algorithms; Phase change random access memory; Program processors; Lempel-Ziv; Parallel algorithms;
Conference_Titel :
Data Compression Conference (DCC), 2013
Conference_Location :
Snowbird, UT
Print_ISBN :
978-1-4673-6037-1
DOI :
10.1109/DCC.2013.20