• DocumentCode
    840548
  • Title

    Incremental Evaluation of Sliding-Window Queries over Data Streams

  • Author

    Ghanem, Thanaa M. ; Hammad, Moustafa A. ; Mokbel, Mohamed F. ; Aref, Walid G. ; Elmagarmid, Ahmed K.

  • Author_Institution
    Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN
  • Volume
    19
  • Issue
    1
  • fYear
    2007
  • Firstpage
    57
  • Lastpage
    72
  • Abstract
    Two research efforts have been conducted to realize sliding-window queries in data stream management systems, namely, query revaluation and incremental evaluation. In the query reevaluation method, two consecutive windows are processed independently of each other. On the other hand, in the incremental evaluation method, the query answer for a window is obtained incrementally from the answer of the preceding window. In this paper, we focus on the incremental evaluation method. Two approaches have been adopted for the incremental evaluation of sliding-window queries, namely, the input-triggered approach and the negative tuples approach. In the input-triggered approach, only the newly inserted tuples flow in the query pipeline and tuple expiration is based on the timestamps of the newly inserted tuples. On the other hand, in the negative tuples approach, tuple expiration is separated from tuple insertion where a tuple flows in the pipeline for every inserted or expired tuple. The negative tuples approach avoids the unpredictable output delays that result from the input-triggered approach. However, negative tuples double the number of tuples through the query pipeline, thus reducing the pipeline bandwidth. Based on a detailed study of the incremental evaluation pipeline, we classify the incremental query operators into two classes according to whether an operator can avoid the processing of negative tuples or not. Based on this classification, we present several optimization techniques over the negative tuples approach that aim to reduce the overhead of processing negative tuples while avoiding the output delay of the query answer. A detailed experimental study, based on a prototype system implementation, shows the performance gains over the input-triggered approach of the negative tuples approach when accompanied with the proposed optimizations
  • Keywords
    database management systems; query processing; data stream management system; incremental evaluation; input-triggered approach; negative tuples approach; optimization technique; pipeline bandwidth; query pipeline; query revaluation; sliding-window query; timestamps; tuple expiration; Bandwidth; Delay; Disaster management; Helium; Performance gain; Pipelines; Prototypes; Query processing; Data stream management systems; negative tuples.; pipelined query execution;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2007.250585
  • Filename
    4016515