• DocumentCode
    3049464
  • Title

    Space-time tradeoffs in the inverse B-W transform

  • Author

    Seward, Julian

  • Author_Institution
    Microsoft Res., Cambridge, UK
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    439
  • Lastpage
    448
  • Abstract
    Lossless text compression based on the Burrows-Wheeler transform (BWT) has become popular. Compression-time issues-MTF coding or the avoidance thereof (Wirth 2000), encoding the MTF values, sorting fast (Seward 2000)-have seen considerable investigation. Decompression-time issues remain underinvestigated. For some applications, decompression-time behaviour is a critical issue. For example, boot-time decompression of the Linux kernel requires decompression in limited memory. On-the-fly disk compressors employing the BWT demand fast decompression. This paper presents a number of ways to implement the inverse B-W transform, giving a useful range of speed-memory tradeoffs. We show that, at one extreme, it is possible to invert the BWT, slowly, using just one byte per block-byte. At the other extreme, we argue that the maximum speed of BWT decompression is unavoidably constrained by the rate at which cache misses are serviced
  • Keywords
    data compression; inverse problems; text analysis; transforms; Burrows-Wheeler transform; cache misses; decompression-time; inverse B-W transform; lossless text compression; space-time tradeoffs; speed-memory tradeoffs; Compressors; Concatenated codes; Disk recording; Encoding; Fasteners; Frequency synthesizers; Kernel; Linux; Sorting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference, 2001. Proceedings. DCC 2001.
  • Conference_Location
    Snowbird, UT
  • ISSN
    1068-0314
  • Print_ISBN
    0-7695-1031-0
  • Type

    conf

  • DOI
    10.1109/DCC.2001.917175
  • Filename
    917175