DocumentCode :
3086121
Title :
A Fast Algorithm for Approximate Quantiles in High Speed Data Streams
Author :
Zhang, Qi ; Wang, Wei
Author_Institution :
Univ. of North Carolina, Chapel Hill
fYear :
2007
fDate :
9-11 July 2007
Firstpage :
29
Lastpage :
29
Abstract :
We present a fast algorithm for computing approximate quantiles in high speed data streams with deterministic error bounds. For data streams of size N where N is unknown in advance, our algorithm partitions the stream into sub-streams of exponentially increasing size as they arrive. For each sub-stream which has a fixed size, we compute and maintain a multi-level summary structure using a novel algorithm. In order to achieve high speed performance, the algorithm uses simple block-wise merge and sample operations. Overall, our algorithms for fixed-size streams and arbitrary-size streams have a computational cost of O(N log(1/epsivlogepsivN)) and an average per-element update cost of O(log logN) if epsiv is fixed.
Keywords :
computational complexity; database management systems; approximate quantiles; arbitrary-size streams; block-wise merge; deterministic error bounds; fast algorithm; high speed data streams; multilevel summary structure; Approximation algorithms; Computational efficiency; Computer errors; Computer science; Costs; Databases; History; Partitioning algorithms; Streaming media; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Scientific and Statistical Database Management, 2007. SSBDM '07. 19th International Conference on
Conference_Location :
Banff, Alta.
ISSN :
1551-6393
Print_ISBN :
0-7695-2868-6
Electronic_ISBN :
1551-6393
Type :
conf
DOI :
10.1109/SSDBM.2007.27
Filename :
4274974
Link To Document :
بازگشت