• DocumentCode
    922239
  • Title

    Improved word-aligned binary compression for text indexing

  • Author

    Anh, Vo Ngoc ; Moffat, Alistair

  • Author_Institution
    Dept. of Comput. Sci. & Software Eng., Melbourne Univ., Vic.
  • Volume
    18
  • Issue
    6
  • fYear
    2006
  • fDate
    6/1/2006 12:00:00 AM
  • Firstpage
    857
  • Lastpage
    861
  • Abstract
    We present an improved compression mechanism for handling the compressed inverted indexes used in text retrieval systems, extending the word-aligned binary coding carry method. Experiments using two typical document collections show that the new method obtains superior compression to previous static codes, without penalty in terms of execution speed
  • Keywords
    binary codes; data compression; indexing; information retrieval; text analysis; compressed inverted indexes; improved word-aligned binary compression; text indexing; text retrieval system; word-aligned binary coding carry method; Binary codes; Compaction; Databases; Decoding; Frequency; Indexing; Information retrieval; Probability distribution; Data compaction and compression; Web searching.; binary code; compression; file organization; indexing methods; inverted index; text retrieval system; text searching; textual databases;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2006.99
  • Filename
    1626238