DocumentCode :
2948740
Title :
Weighted Bloom filter
Author :
Bruck, J. ; Jie Gao ; Anxiao Jiang
Author_Institution :
Dept. of Electr. Eng., California Inst. of Technol., Pasadena, CA
fYear :
2006
fDate :
9-14 July 2006
Abstract :
A Bloom filter is a simple randomized data structure that answers membership query with no false negative and a small false positive probability. It is an elegant data compression technique for membership information and has broad applications. In this paper, we generalize the traditional Bloom filter to weighted Bloom filter, which incorporates the information on the query frequencies and the membership likelihood of the elements into its optimal design. It has been widely observed that in many applications, some popular elements are queried much more often than the others. The traditional Bloom filter for data sets with irregular query patterns and non-uniform membership likelihood can be further optimized. We derive the optimal configuration of the Bloom filter with query-frequency and membership-likelihood information, and show that the adapted Bloom filter always outperforms the traditional Bloom filter. Under reasonable frequency models such as the step distribution or the Zipf´s distribution, the improvement of the false positive probability of the weighted Bloom filter over that of the traditional Bloom filter has been evaluated by simulations
Keywords :
adaptive filters; data compression; probability; adapted Bloom filter; elegant data compression technique; membership-likelihood information; positive probability; query-frequency; randomized data structure; weighted Bloom filter; Application software; Combinatorial mathematics; Computer science; Data compression; Data structures; Frequency; Information filtering; Information filters; Statistical distributions; Web server; Bloom Filter; Combinatorics; Membership Query;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory, 2006 IEEE International Symposium on
Conference_Location :
Seattle, WA
Print_ISBN :
1-4244-0505-X
Type :
conf
DOI :
10.1109/ISIT.2006.261978
Filename :
4036381
Link To Document :
بازگشت