DocumentCode :
3575251
Title :
Content sharing in information storage and retrieval system using tree representation of documents
Author :
Sharma, Dharmendra ; Jain, Suresh
Author_Institution :
Dept. of Comput. Eng., Mewar Univ., Chittorgarh, India
fYear :
2014
Firstpage :
1
Lastpage :
4
Abstract :
In general document collections system contains groups of documents with overlapping content. However, most information storage and retrieval systems process each document separately, causing shared content to be indexed multiple times. In this paper, we describe a new document representation model where related documents are organized as a tree that allowing shared content to be indexed just once. We show that how this representation model can reduce the size of an inverted index as well as the time to build it.
Keywords :
document handling; information retrieval; information storage; trees (mathematics); content sharing; document collection system; document representation model; information retrieval system; information storage; inverted index; tree representation; Indexes; Optimized production technology; Payloads; Standards; Uniform resource locators; Free text quires; Inverted index; term weight;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
IT in Business, Industry and Government (CSIBIG), 2014 Conference on
Print_ISBN :
978-1-4799-3063-0
Type :
conf
DOI :
10.1109/CSIBIG.2014.7056941
Filename :
7056941
Link To Document :
بازگشت