• DocumentCode
    562583
  • Title

    Inverted index compression using Extended Golomb Code

  • Author

    Glory, V. ; Domnic, S.

  • Author_Institution
    Dept. of Comput. Applic., Nat. Inst. of Technol., Tiruchirappalli, India
  • fYear
    2012
  • fDate
    30-31 March 2012
  • Firstpage
    20
  • Lastpage
    25
  • Abstract
    Web Search Engines use inverted index structures for efficient query processing. But the size of the inverted index is extremely large due to rapid growth in the size of the text data in the web. In order to reduce the index size and increase the accessing speed, compression techniques are used. In this paper, we make use of a new integer compression technique, Extended Golomb Code (EGC), to reduce the size of the inverted index. We have tested the performance of EGC with other existing techniques. Experimental results show that EGC performs better than other existing techniques in compressing inverted index.
  • Keywords
    Internet; data compression; indexing; query processing; search engines; text analysis; EGC; Web search engine; accessing speed; extended Golomb code; index size; information retrieval system; integer compression technique; inverted index compression; inverted index structure; query processing; text data size; Indexes; D-Gap; Information Retrieval System; Inverted File; Inverted Index Compression; Search Engines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Engineering, Science and Management (ICAESM), 2012 International Conference on
  • Conference_Location
    Nagapattinam, Tamil Nadu
  • Print_ISBN
    978-1-4673-0213-5
  • Type

    conf

  • Filename
    6215567