DocumentCode :
2381539
Title :
Random Sampling over Streaming Window Joins
Author :
Ren, Jiadong ; Jiang, Wanchang ; Huo, Cong
fYear :
2007
fDate :
1-3 Nov. 2007
Firstpage :
53
Lastpage :
55
Abstract :
Two novel sampling approaches are proposed to obtain a random sample of exact streaming window join result. Without assuming any model of stream arrivals, the frequency of join attribute values for various basic periods can be obtained by a frequency balanced binary tree histogram (FATH) which is constructed for each stream. The frequency for the future window can be computed by linear regression with the help of the information in the FATH. With the random sample of exact join result produced, a windowed aggregate over the exact join results can be unbiasedly and accurately estimated. Experimental results show that our approach is more efficient than other approach for arbitrary streams.
Keywords :
Aggregates; Binary trees; Clustering algorithms; Data privacy; Educational institutions; Frequency estimation; Histograms; Information science; Linear regression; Sampling methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data, Privacy, and E-Commerce, 2007. ISDPE 2007. The First International Symposium on
Conference_Location :
Chengdu
Print_ISBN :
978-0-7695-3016-1
Type :
conf
DOI :
10.1109/ISDPE.2007.51
Filename :
4402638
Link To Document :
بازگشت