DocumentCode
3049464
Title
Space-time tradeoffs in the inverse B-W transform
Author
Seward, Julian
Author_Institution
Microsoft Res., Cambridge, UK
fYear
2001
fDate
2001
Firstpage
439
Lastpage
448
Abstract
Lossless text compression based on the Burrows-Wheeler transform (BWT) has become popular. Compression-time issues-MTF coding or the avoidance thereof (Wirth 2000), encoding the MTF values, sorting fast (Seward 2000)-have seen considerable investigation. Decompression-time issues remain underinvestigated. For some applications, decompression-time behaviour is a critical issue. For example, boot-time decompression of the Linux kernel requires decompression in limited memory. On-the-fly disk compressors employing the BWT demand fast decompression. This paper presents a number of ways to implement the inverse B-W transform, giving a useful range of speed-memory tradeoffs. We show that, at one extreme, it is possible to invert the BWT, slowly, using just one byte per block-byte. At the other extreme, we argue that the maximum speed of BWT decompression is unavoidably constrained by the rate at which cache misses are serviced
Keywords
data compression; inverse problems; text analysis; transforms; Burrows-Wheeler transform; cache misses; decompression-time; inverse B-W transform; lossless text compression; space-time tradeoffs; speed-memory tradeoffs; Compressors; Concatenated codes; Disk recording; Encoding; Fasteners; Frequency synthesizers; Kernel; Linux; Sorting;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference, 2001. Proceedings. DCC 2001.
Conference_Location
Snowbird, UT
ISSN
1068-0314
Print_ISBN
0-7695-1031-0
Type
conf
DOI
10.1109/DCC.2001.917175
Filename
917175
Link To Document