• DocumentCode
    2450034
  • Title

    Towards execution guarantees for stream queries

  • Author

    Fernández-Moctezuma, Rafael J. ; Maier, David ; Tufte, Kristin A.

  • Author_Institution
    Dept. of Comput. Sci., Portland State Univ., Portland, OR, USA
  • fYear
    2010
  • fDate
    19-23 April 2010
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    The unbounded nature of data streams and the low-latency requirements of stream processing present interesting challenges in Data Stream Management System (DSMS) design. Streaming query operators are typically designed to produce results with low latency, as well as to efficiently manage their state. Stream-progress delimitation techniques, such as punctuation, can help query operators achieve these goals. In this work, we look at deriving execution guarantees with respect to result production and state management for complete queries over punctuated streams. These guarantees are derived before query execution. We formalize notions of successful stream processing at an operator level, and extend these definitions to stream queries as a whole. We introduce a framework, punctuation contracts, for analyzing data processing and punctuation propagation from input to output on individual operators. We then use our framework to analyze complete queries and determine, prior to execution, if every valid input is eventually emitted, and no item remains in operator state indefinitely. Finally, we discuss extensions needed to bound query memory requirements; we describe four stream properties that can be used to help understand and quantify memory and CPU usage.
  • Keywords
    query processing; data stream management system design; data streams; execution guarantees; low-latency requirements; punctuation contracts; punctuation propagation; query execution; query memory requirements; query operators; stream processing; stream queries; stream-progress delimitation; Central Processing Unit; Computer science; Contracts; Data analysis; Data processing; Delay; Pattern analysis; Production; Query processing; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on
  • Conference_Location
    Atlanta, GA
  • Print_ISBN
    978-1-4244-6533-0
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2010.5470850
  • Filename
    5470850