DocumentCode
562583
Title
Inverted index compression using Extended Golomb Code
Author
Glory, V. ; Domnic, S.
Author_Institution
Dept. of Comput. Applic., Nat. Inst. of Technol., Tiruchirappalli, India
fYear
2012
fDate
30-31 March 2012
Firstpage
20
Lastpage
25
Abstract
Web Search Engines use inverted index structures for efficient query processing. But the size of the inverted index is extremely large due to rapid growth in the size of the text data in the web. In order to reduce the index size and increase the accessing speed, compression techniques are used. In this paper, we make use of a new integer compression technique, Extended Golomb Code (EGC), to reduce the size of the inverted index. We have tested the performance of EGC with other existing techniques. Experimental results show that EGC performs better than other existing techniques in compressing inverted index.
Keywords
Internet; data compression; indexing; query processing; search engines; text analysis; EGC; Web search engine; accessing speed; extended Golomb code; index size; information retrieval system; integer compression technique; inverted index compression; inverted index structure; query processing; text data size; Indexes; D-Gap; Information Retrieval System; Inverted File; Inverted Index Compression; Search Engines;
fLanguage
English
Publisher
ieee
Conference_Titel
Advances in Engineering, Science and Management (ICAESM), 2012 International Conference on
Conference_Location
Nagapattinam, Tamil Nadu
Print_ISBN
978-1-4673-0213-5
Type
conf
Filename
6215567
Link To Document