DocumentCode :
2282132
Title :
Signature file hashing using term occurrence and query frequencies
Author :
Aktug, Deniz ; Can, Fazli
Author_Institution :
Dept. of Syst. Anal., Miami Univ., Oxford, OH, USA
fYear :
1993
fDate :
23-26 Mar 1993
Firstpage :
148
Lastpage :
153
Abstract :
Signature files act as a filter on retrieval to discard a large number of nonqualifying data items. Linear hashing with superimposed signatures (LHSS) provides an effective retrieval filter to process queries in dynamic databases. This study is an analysis of the effects of reflecting the term occurrence and query frequencies to signatures in LHSS. This approach relaxes the unrealistic uniform frequency assumption and lets the terms with high discriminatory power set more bits in signatures. The simulation experiments based on the derived formulas explore the amount of page savings with different occurrence and query frequency combinations at different hashing levels. The results show that the performance of LHSS improves with the hashing level and the larger the difference between the term discriminatory power values of the terms, the higher the retrieval efficiency. The authors also discuss the benefits of this approach to alleviate the imbalance between the levels of efficiency and relevancy in the unrealistic uniform frequency assumption case
Keywords :
distributed databases; file organisation; query processing; dynamic databases; linear hashing with superimposed signatures; query frequencies; signature file hashing; simulation; term occurrence; Databases; Frequency; Information retrieval; Nonlinear filters;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computers and Communications, 1993., Twelfth Annual International Phoenix Conference on
Conference_Location :
Tempe, AZ
Print_ISBN :
0-7803-0922-7
Type :
conf
DOI :
10.1109/PCCC.1993.344471
Filename :
344471
Link To Document :
بازگشت