DocumentCode :
2181212
Title :
Probabilistic counting
Author :
Flajolet, Philippe ; Martin, G.Nigel
fYear :
1983
fDate :
7-9 Nov. 1983
Firstpage :
76
Lastpage :
82
Abstract :
We present here a class of probabilistic algorithms with which one can estimate the number of distinct elements in a collection of data (typically a large file stored on disk) in a single pass, using only 0(1) auxiliary storage and 0(1) operations per element. We precisely quantify the accuracy-storage trade-offs: for instance a typical accuracy of about 5% can be achieved using only 256 binary words, even for very large files. The algorithms are totally insensitive to the replicative structure of the elements in the file. They are particularly adapted to data base systems in the context of query optimization and can be implemented in a decentralized manner (thus making them also useful for distributed data base applications).
Keywords :
Algorithm design and analysis; Counting circuits; Degradation; Distributed processing; Laboratories; Performance gain; Sampling methods; Sorting; Winches;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Foundations of Computer Science, 1983., 24th Annual Symposium on
Conference_Location :
Tucson, AZ, USA
ISSN :
0272-5428
Print_ISBN :
0-8186-0508-1
Type :
conf
DOI :
10.1109/SFCS.1983.46
Filename :
4568063
Link To Document :
بازگشت