DocumentCode
668133
Title
Parallelizing windowed stream joins in a shared-nothing cluster
Author
Chakraborty, Arpan ; Singh, Ashutosh
Author_Institution
Sch. of Inf. & Comput. Sci., Indiana Univ., Bloomington, IN, USA
fYear
2013
fDate
23-27 Sept. 2013
Firstpage
1
Lastpage
5
Abstract
The availability of a large number of processing nodes in a parallel and distributed computing environment enables sophisticated real time processing over high speed data streams, as required by many emerging applications. Sliding window stream joins are among the most important operators in a stream processing system. In this paper, we consider the issue of parallelizing a sliding window stream join operator over a shared nothing cluster. We propose a Bulk Synchronous Processing (BSP) framework, based on a fixed or predefined sequence of communication, to distribute the join processing loads over a shared-nothing cluster. We consider various processing and communication overheads while scaling over a large number of nodes, and propose solution methodologies to cope with the issues.We implement the algorithm over a cluster using a message passing system, and present the experimental results showing the effectiveness of the join processing algorithm.
Keywords
application program interfaces; message passing; parallel processing; BSP framework; bulk synchronous processing framework; communication overhead; distributed computing environment; join processing algorithm; message passing system; parallel computing environment; parallelization; processing nodes; processing overhead; shared nothing cluster; shared-nothing cluster; stream processing system; windowed stream joins; Databases; Delays; Heuristic algorithms; Parallel processing; Partitioning algorithms; Scalability; Tuning;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing (CLUSTER), 2013 IEEE International Conference on
Conference_Location
Indianapolis, IN
Type
conf
DOI
10.1109/CLUSTER.2013.6702636
Filename
6702636
Link To Document