Title :
Improved word-aligned binary compression for text indexing
Author :
Anh, Vo Ngoc ; Moffat, Alistair
Author_Institution :
Dept. of Comput. Sci. & Software Eng., Melbourne Univ., Vic.
fDate :
6/1/2006 12:00:00 AM
Abstract :
We present an improved compression mechanism for handling the compressed inverted indexes used in text retrieval systems, extending the word-aligned binary coding carry method. Experiments using two typical document collections show that the new method obtains superior compression to previous static codes, without penalty in terms of execution speed
Keywords :
binary codes; data compression; indexing; information retrieval; text analysis; compressed inverted indexes; improved word-aligned binary compression; text indexing; text retrieval system; word-aligned binary coding carry method; Binary codes; Compaction; Databases; Decoding; Frequency; Indexing; Information retrieval; Probability distribution; Data compaction and compression; Web searching.; binary code; compression; file organization; indexing methods; inverted index; text retrieval system; text searching; textual databases;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2006.99