DocumentCode
2625887
Title
Clustering-Variable-Width Histogram Based Window Semi-hash Multi-join over Streams
Author
Zhang, Xiaojian ; Jiang, Wanchang ; Zhang, Yadong ; Huo, Cong
Author_Institution
Henan Univ. of Finance & Econ., Zhengzhou
fYear
2007
fDate
21-23 Nov. 2007
Firstpage
850
Lastpage
853
Abstract
Join operator has become more and more important in the context of data stream. Most join algorithms over streams to date are based on nested loop joins or hash joins. However, these time expensive algorithms can not suit for CPU-limited case. In this paper, a clustering-based variable-width histogram is designed for obtaining value distribution of tuples in the sliding windows, and some important characteristics of tuples can be retained. Semi-hash tables can be constructed by using the histogram. Our sliding window semi-hash multi-join algorithm can minimize the processing time cost of join and produce an accurate join result much earlier. Experimental results show that our approach is more efficient than other approaches.
Keywords
data analysis; file organisation; query processing; clustering-variable-width histogram; data stream; nested loop joins; processing time cost; time expensive algorithms; window semi-hash multi-join; Clustering algorithms; Computer science; Costs; Data analysis; Data engineering; Educational institutions; Finance; Histograms; Information science; Information technology;
fLanguage
English
Publisher
ieee
Conference_Titel
Convergence Information Technology, 2007. International Conference on
Conference_Location
Gyeongju
Print_ISBN
0-7695-3038-9
Type
conf
DOI
10.1109/ICCIT.2007.246
Filename
4420366
Link To Document