DocumentCode
922239
Title
Improved word-aligned binary compression for text indexing
Author
Anh, Vo Ngoc ; Moffat, Alistair
Author_Institution
Dept. of Comput. Sci. & Software Eng., Melbourne Univ., Vic.
Volume
18
Issue
6
fYear
2006
fDate
6/1/2006 12:00:00 AM
Firstpage
857
Lastpage
861
Abstract
We present an improved compression mechanism for handling the compressed inverted indexes used in text retrieval systems, extending the word-aligned binary coding carry method. Experiments using two typical document collections show that the new method obtains superior compression to previous static codes, without penalty in terms of execution speed
Keywords
binary codes; data compression; indexing; information retrieval; text analysis; compressed inverted indexes; improved word-aligned binary compression; text indexing; text retrieval system; word-aligned binary coding carry method; Binary codes; Compaction; Databases; Decoding; Frequency; Indexing; Information retrieval; Probability distribution; Data compaction and compression; Web searching.; binary code; compression; file organization; indexing methods; inverted index; text retrieval system; text searching; textual databases;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2006.99
Filename
1626238
Link To Document