• DocumentCode
    2386321
  • Title

    Multi-aggregate-query Scheduling over Data Streams

  • Author

    Chen, Tao ; Xiao, Nong ; Liu, Fang

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Defense Techonology, Changsha, China
  • fYear
    2010
  • fDate
    8-11 Dec. 2010
  • Firstpage
    27
  • Lastpage
    33
  • Abstract
    With the wide applications of data streams in many fields, such as sensor network monitoring and internet traffic control, query processing over data streams has become increasingly important. In these applications, multiple aggregate queries are registered in the system, and have different sliding window sizes and different frequency upper bounds. How to share the results of these queries is a challenge. Prior work studies how to detect common tasks of these queries and share the results by computing the common tasks only once. Hybrid scheduling first addressed this problem and used the earliest-deadline-first (EDF) method. However, this work did not present a method for computing the scheduling. We formulate the scheduling problem among multiple aggregate queries with different sliding window sizes and different frequency upper bounds over data streams and propose a combination rule to classify these queries. Then, we present an efficient scheduling algorithm to decide whether a query should be executed more often than necessary, as long as the interval between two consecutive executions is less than the frequency upper bound. We also combine our scheduling algorithm with EDF to handle under loaded and overloaded situations. An experimental study shows that our scheduling algorithms are more efficient than no scheduling and EDF in terms of the number of scanned tuples, the throughput and the latency.
  • Keywords
    database management systems; query processing; scheduling; data stream; earliest deadline first method; hybrid scheduling; multiaggregate query scheduling; Aggregates; Schedules; Scheduling; Scheduling algorithm; Semantics; Upper bound; data streams; frequency upper bounds; mutiple aggerate queries; scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2010 International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-9110-0
  • Electronic_ISBN
    978-0-7695-4287-4
  • Type

    conf

  • DOI
    10.1109/PDCAT.2010.92
  • Filename
    5704400