DocumentCode
3086121
Title
A Fast Algorithm for Approximate Quantiles in High Speed Data Streams
Author
Zhang, Qi ; Wang, Wei
Author_Institution
Univ. of North Carolina, Chapel Hill
fYear
2007
fDate
9-11 July 2007
Firstpage
29
Lastpage
29
Abstract
We present a fast algorithm for computing approximate quantiles in high speed data streams with deterministic error bounds. For data streams of size N where N is unknown in advance, our algorithm partitions the stream into sub-streams of exponentially increasing size as they arrive. For each sub-stream which has a fixed size, we compute and maintain a multi-level summary structure using a novel algorithm. In order to achieve high speed performance, the algorithm uses simple block-wise merge and sample operations. Overall, our algorithms for fixed-size streams and arbitrary-size streams have a computational cost of O(N log(1/epsivlogepsivN)) and an average per-element update cost of O(log logN) if epsiv is fixed.
Keywords
computational complexity; database management systems; approximate quantiles; arbitrary-size streams; block-wise merge; deterministic error bounds; fast algorithm; high speed data streams; multilevel summary structure; Approximation algorithms; Computational efficiency; Computer errors; Computer science; Costs; Databases; History; Partitioning algorithms; Streaming media; Upper bound;
fLanguage
English
Publisher
ieee
Conference_Titel
Scientific and Statistical Database Management, 2007. SSBDM '07. 19th International Conference on
Conference_Location
Banff, Alta.
ISSN
1551-6393
Print_ISBN
0-7695-2868-6
Electronic_ISBN
1551-6393
Type
conf
DOI
10.1109/SSDBM.2007.27
Filename
4274974
Link To Document