DocumentCode
147045
Title
Hybrid Compression of Bitvectors for the FM-Index
Author
Karkkainen, J. ; Kempa, Dominik ; Puglisi, Simon J.
Author_Institution
Dept. of Comput. Sci., Univ. of Helsinki, Helsinki, Finland
fYear
2014
fDate
26-28 March 2014
Firstpage
302
Lastpage
311
Abstract
Compressed bit vectors supporting rank and select operations are the workhorse of compressed data structures. We propose a hybrid scheme for implementing compressed bit vectors, which divides the bit vector into blocks and then chooses the encoding of each block separately from a number of different encoding methods. Hybrid encoding is particularly suitable for bit vectors that have lots of local and regional variation, such as those present in the FM-index, a popular compressed data structure for pattern matching. We propose a specific hybrid combination of three simple encoding methods for FM-index bit vectors achieving superior space-time tradeoffs in experiments.
Keywords
data compression; data structures; pattern matching; FM-index; bitvectors hybrid compression; compressed bit vectors; compressed data structure; encoding methods; hybrid encoding; pattern matching; Arrays; Bioinformatics; Encoding; Entropy; Indexes; Pattern matching; Compressed bitvectors; FM-index; Succinct data structures;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference (DCC), 2014
Conference_Location
Snowbird, UT
ISSN
1068-0314
Type
conf
DOI
10.1109/DCC.2014.87
Filename
6824438
Link To Document