• DocumentCode
    3086121
  • Title

    A Fast Algorithm for Approximate Quantiles in High Speed Data Streams

  • Author

    Zhang, Qi ; Wang, Wei

  • Author_Institution
    Univ. of North Carolina, Chapel Hill
  • fYear
    2007
  • fDate
    9-11 July 2007
  • Firstpage
    29
  • Lastpage
    29
  • Abstract
    We present a fast algorithm for computing approximate quantiles in high speed data streams with deterministic error bounds. For data streams of size N where N is unknown in advance, our algorithm partitions the stream into sub-streams of exponentially increasing size as they arrive. For each sub-stream which has a fixed size, we compute and maintain a multi-level summary structure using a novel algorithm. In order to achieve high speed performance, the algorithm uses simple block-wise merge and sample operations. Overall, our algorithms for fixed-size streams and arbitrary-size streams have a computational cost of O(N log(1/epsivlogepsivN)) and an average per-element update cost of O(log logN) if epsiv is fixed.
  • Keywords
    computational complexity; database management systems; approximate quantiles; arbitrary-size streams; block-wise merge; deterministic error bounds; fast algorithm; high speed data streams; multilevel summary structure; Approximation algorithms; Computational efficiency; Computer errors; Computer science; Costs; Databases; History; Partitioning algorithms; Streaming media; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Management, 2007. SSBDM '07. 19th International Conference on
  • Conference_Location
    Banff, Alta.
  • ISSN
    1551-6393
  • Print_ISBN
    0-7695-2868-6
  • Electronic_ISBN
    1551-6393
  • Type

    conf

  • DOI
    10.1109/SSDBM.2007.27
  • Filename
    4274974