DocumentCode
3023755
Title
Why Not Semijoins for Streams, When Distributed?
Author
Tri Tran ; Byung Suk Lee ; Bovee, M.W.
Author_Institution
Univ. of Vermont, Burlington
fYear
2007
fDate
1-5 July 2007
Firstpage
27
Lastpage
27
Abstract
This paper addresses the semijoin-based window join algorithm over distributed data streams. In distributed stream query processing, data streams arriving at remote sites need to be shipped to the processing site for query execution. This typically introduces high communication overhead over the network. Our observation is that semi- join, effective to reduce communication overhead in distributed database query processing, can be also effective in distributed stream query processing. The challenge, of course, lies in the streaming nature of tuples, the processing of which is fundamentally different from processing a set of tuples. We address this challenge by first adapting the window-based stream join to a distributed environment. The resulting join algorithm (called simple join) uses the idea of exporting a window to the query processing site. We then adopt the semijoin to reduce the communication overhead (in return for a marginal increase of the processing overhead). The resulting semijoin-based join algorithm uses the ideas of a mirror window and a partial tuple. That is, it creates a copy of a remote window at the processing site and sends a partial tuple to probe for matching tuples before sending a full tuple. Finally, we analyze the two join algorithms using our proposed cost models and verify the analysis results through a set of experiments.
Keywords
distributed databases; query processing; communication overhead; distributed data streams; distributed database query processing; distributed stream query processing; query execution; remote window; semijoin-based window join algorithm; Algorithm design and analysis; Computer science; Costs; Distributed databases; Mirrors; Monitoring; Probes; Query processing; Strontium; Telecommunication traffic;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Telecommunications, 2007. ICDT '07. Second International Conference on
Conference_Location
San Jose, CA
Print_ISBN
0-7695-2910-0
Electronic_ISBN
0-7695-2910-0
Type
conf
DOI
10.1109/ICDT.2007.38
Filename
4270593
Link To Document