• DocumentCode
    610038
  • Title

    Practical Parallel Lempel-Ziv Factorization

  • Author

    Julian Shun ; Fuyao Zhao

  • fYear
    2013
  • fDate
    20-22 March 2013
  • Firstpage
    123
  • Lastpage
    132
  • Abstract
    In the age of big data, the need for efficient data compression algorithms has grown. A widely used data compression method is the Lempel-Ziv-77 (LZ77) method, being a subroutine in popular compression packages such as gzip and PKZIP. There has been a lot of recent effort on developing practical sequential algorithms for Lempel-Ziv factorization (equivalent to LZ77 compression), but research in practical parallel implementations has been less satisfactory. In this work, we present a simple work-efficient parallel algorithm for Lempel-Ziv factorization. We show theoretically that our algorithm requires linear work and runs in O(log2 n) time (randomized) for constant alphabets and O(nϵ) time (ϵ <; 1) for integer alphabets. We present experimental results showing that our algorithm is efficient and achieves good speedup with respect to the best sequential implementations of Lempel-Ziv factorization.
  • Keywords
    computational complexity; data compression; parallel algorithms; randomised algorithms; LZ77 method; Lempel-Ziv-77 method; O(log2 n) time; O(nϵ) time; PKZIP; data compression algorithm; integer alphabet; parallel Lempel-Ziv factorization; sequential algorithm; Algorithm design and analysis; Data compression; Dictionaries; Indexes; Parallel algorithms; Phase change random access memory; Program processors; Lempel-Ziv; Parallel algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference (DCC), 2013
  • Conference_Location
    Snowbird, UT
  • ISSN
    1068-0314
  • Print_ISBN
    978-1-4673-6037-1
  • Type

    conf

  • DOI
    10.1109/DCC.2013.20
  • Filename
    6543048