• DocumentCode
    2719977
  • Title

    Parallel Lossless Data Compression Based on the Burrows-Wheeler Transform

  • Author

    Gilchrist, Jeff ; Cuhadar, Aysegul

  • Author_Institution
    Dept. Syst. & Comput. Eng., Carleton Univ., Ottawa, ON
  • fYear
    2007
  • fDate
    21-23 May 2007
  • Firstpage
    877
  • Lastpage
    884
  • Abstract
    In this paper, we present parallel algorithms for lossless data compression based on the Burrows-Wheeler transform (BWT) block-sorting technique. We investigate the performance of using data parallelism and task parallelism for both multi-threaded and message-passing programming. The output produced by the parallel algorithms is fully compatible with their sequential counterparts. To balance the workload among processors we develop a task scheduling strategy. An extensive set of experiments is performed with a shared memory NUMA system using up to 120 processors and on a distributed memory cluster using up to 100 processors. Our experimental results show that significant speedup can be achieved with both data parallel and task parallel methodologies. These algorithms will greatly reduce the amount of time it takes to compress large amounts of data while the compressed data remains in a form that users without access to multiple processor systems can still use.
  • Keywords
    data compression; message passing; multi-threading; transforms; Burrows-Wheeler transform; block-sorting technique; data parallelism; message-passing programming; multi-threaded programming; parallel lossless data compression; shared memory NUMA system; task parallelism; Arithmetic; Clustering algorithms; Compression algorithms; Data compression; Image coding; Image reconstruction; Parallel algorithms; Parallel processing; Sorting; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications, 2007. AINA '07. 21st International Conference on
  • Conference_Location
    Niagara Falls, ON
  • ISSN
    1550-445X
  • Print_ISBN
    0-7695-2846-5
  • Type

    conf

  • DOI
    10.1109/AINA.2007.109
  • Filename
    4220984