DocumentCode
2450034
Title
Towards execution guarantees for stream queries
Author
Fernández-Moctezuma, Rafael J. ; Maier, David ; Tufte, Kristin A.
Author_Institution
Dept. of Comput. Sci., Portland State Univ., Portland, OR, USA
fYear
2010
fDate
19-23 April 2010
Firstpage
1
Lastpage
8
Abstract
The unbounded nature of data streams and the low-latency requirements of stream processing present interesting challenges in Data Stream Management System (DSMS) design. Streaming query operators are typically designed to produce results with low latency, as well as to efficiently manage their state. Stream-progress delimitation techniques, such as punctuation, can help query operators achieve these goals. In this work, we look at deriving execution guarantees with respect to result production and state management for complete queries over punctuated streams. These guarantees are derived before query execution. We formalize notions of successful stream processing at an operator level, and extend these definitions to stream queries as a whole. We introduce a framework, punctuation contracts, for analyzing data processing and punctuation propagation from input to output on individual operators. We then use our framework to analyze complete queries and determine, prior to execution, if every valid input is eventually emitted, and no item remains in operator state indefinitely. Finally, we discuss extensions needed to bound query memory requirements; we describe four stream properties that can be used to help understand and quantify memory and CPU usage.
Keywords
query processing; data stream management system design; data streams; execution guarantees; low-latency requirements; punctuation contracts; punctuation propagation; query execution; query memory requirements; query operators; stream processing; stream queries; stream-progress delimitation; Central Processing Unit; Computer science; Contracts; Data analysis; Data processing; Delay; Pattern analysis; Production; Query processing; Telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on
Conference_Location
Atlanta, GA
Print_ISBN
978-1-4244-6533-0
Type
conf
DOI
10.1109/IPDPSW.2010.5470850
Filename
5470850
Link To Document