Title :
An Efficient Approach for Inverted Index Pruning Based on Document Relevance
Author :
Vishwakarma, Santosh K. ; Lakhtaria, Kamaljit I. ; Bhatnagar, Deepak ; Sharma, Arvind Kumar
Author_Institution :
Sir Padampat Singhania Univ., Udaipur, India
Abstract :
Information Retrieval deals with retrieving documents from a large collection that matches the information need of a user. Efficient retrieval is based on the proper storage of the inverted index. There have been many techniques for reducing the size of the inverted index. Static index pruning is one such technique, which is used to reduce the index size. This paper investigates a static index pruning approach which is useful to reduce the index size. The proposed approach prunes the entire document from the index based on its importance and relevance of top-k results. The elimination takes place on the basis of the score of the individual document. Experiments have been conducted on the FIRE text collection. Based on the results, it was found that for specific collections, the proposed model gives better precision values for the retrieval of top 30 and above documents.
Keywords :
document handling; indexing; relevance feedback; FIRE text collection; document relevance; document retrieval; information retrieval; inverted index pruning; inverted index size reduction; inverted index storage; static index pruning; user information needs; Conferences; Educational institutions; Fires; Frequency measurement; Indexes; Information retrieval; Document Relevance; Information Retrieval; Static Index Pruning; tf-idf weighting score;
Conference_Titel :
Communication Systems and Network Technologies (CSNT), 2014 Fourth International Conference on
Conference_Location :
Bhopal
Print_ISBN :
978-1-4799-3069-2
DOI :
10.1109/CSNT.2014.103