Title :
A Set-Covering-Based Approach for Overlapping Resource Selection in Distributed Information Retrieval
Author :
Wang, Xiuhong ; Ju, Shiguang
Author_Institution :
Inst. of Sci. & Technol. Inf., Jiangsu Univ., Zhenjiang, China
fDate :
March 31 2009-April 2 2009
Abstract :
Resource selection, also called server selection, collection selection or database selection, is a foundational problem in distributed information retrieval (DIR). This paper introduces a set-covering-based algorithm for resource selection in DIR, with consideration of overlapping extent between resources. Give different document with different weight according to its position in merged results for question Q. Only results that have not appeared in some earlier selected resource are focused on in later selected resources. The score of each resource is decided by the total weights of those merged results included in, and only the resource with max score is selected in each selecting step. So, the selecting order is the actual rank of selected resources which are used to search the question Qpsila, which is similar to question Q. The approach saves big searching time due to overlapping between databases and, at the same time, enhances user´s recall rate and precision.
Keywords :
database management systems; information retrieval; collection selection; database selection; distributed information retrieval; overlapping resource selection; server selection; set-covering-based approach; Computer science; Costs; Data engineering; Distributed databases; Feedback; Frequency; Indexing; Information retrieval; Merging; Statistics; Distributed information retrieval; Resource selection; Set-covering-based algorithm;
Conference_Titel :
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-0-7695-3507-4
DOI :
10.1109/CSIE.2009.702