• DocumentCode
    147080
  • Title

    Relative Lempel-Ziv with Constant-Time Random Access

  • Author

    Gagie, Travis ; Puglisi, Simon J.

  • Author_Institution
    Univ. of Helsinki, Helsinki, Finland
  • fYear
    2014
  • fDate
    26-28 March 2014
  • Firstpage
    405
  • Lastpage
    405
  • Abstract
    Relative Lempel-Ziv [1] (RLZ) is a variant of LZ77 that can compress collections of similar genomes well, while still allowing fast random access to them. We implemented RLZ using compressed bit vectors to support constant-time random access at the cost of sublinear extra space. We compared our implementation of RLZ to Deorowicz and Grabowski´s GDC [11] scheme and achieved comparable compression and much smaller access times for short substrings.
  • Keywords
    bioinformatics; data compression; genomics; LZ77; RLZ; compressed bit vectors; constant time random access; genomes collection compression; relative Lempel-Ziv; sublinear extra space; Bioinformatics; Data compression; Educational institutions; Genomics; Information retrieval; Libraries; Robustness; random access; relative Lempel-Ziv;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference (DCC), 2014
  • Conference_Location
    Snowbird, UT
  • ISSN
    1068-0314
  • Type

    conf

  • DOI
    10.1109/DCC.2014.42
  • Filename
    6824457