DocumentCode :
841412
Title :
Compressed Bloom filters
Author :
Mitzenmacher, Michael
Author_Institution :
Harvard Univ., Cambridge, MA, USA
Volume :
10
Issue :
5
fYear :
2002
fDate :
10/1/2002 12:00:00 AM
Firstpage :
604
Lastpage :
612
Abstract :
A Bloom filter is a simple space-efficient randomized data structure for representing a set in order to support membership queries. Although Bloom filters allow false positives, for many applications the space savings outweigh this drawback when the probability of an error is sufficiently low. We introduce compressed Bloom filters, which improve performance when the Bloom filter is passed as a message, and its transmission size is a limiting factor. For example, Bloom filters have been suggested as a means for sharing Web cache information. In this setting, proxies do not share the exact contents of their caches, but instead periodically broadcast Bloom filters representing their caches. By using compressed Bloom filters, proxies can reduce the number of bits broadcast, the false positive probability, and/or the amount of computation per lookup. The cost is the processing time for compression and decompression, which can use simple arithmetic coding, and more memory use at the proxies, which utilize the larger uncompressed form of the Bloom filter.
Keywords :
Internet; arithmetic codes; cache storage; data compression; data structures; error statistics; performance evaluation; protocols; Web cache information; arithmetic coding; compressed Bloom filters; computation per lookup; computer networks; decompression; distributed computing; distributed information systems; error probability; false positive probability; information theory; membership queries; memory use; performance; processing time; protocols; proxies; randomized data structure; space-efficient data structure; Arithmetic; Broadcasting; Computer networks; Costs; Data structures; Digital filters; Information filtering; Information filters; Information theory; Protocols;
fLanguage :
English
Journal_Title :
Networking, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1063-6692
Type :
jour
DOI :
10.1109/TNET.2002.803864
Filename :
1041067
Link To Document :
بازگشت