DocumentCode :
1691833
Title :
A stratified sampling algorithm for landmark windows over data streams
Author :
Zhao, Guangyuan ; Zhang, Longbo ; Wang, Fengying ; Li, Caihong ; Wang, Yong
Author_Institution :
Sch. of Comput. Sci., Shandong Univ. of Technol., Zibo, China
fYear :
2010
Firstpage :
2817
Lastpage :
2822
Abstract :
In many applications, data does not take the form of traditional stored relations, but rather arrives in continuous, rapid, time-varying data streams,and data streams are potentially unbounded in size. Focusing on the problem of sampling from landmark windows over data streams, a new concept, which is called stratified sampling ratio function, is presented. Then a multistage stratified sampling algorithm for landmark window model is introduced. In the algorithm, a dynamic candidate sample set is maintained. When an arrived tuple is determined to enter the sample set and to be deleted from the sample, the arrival time of data items is considered, and the probability for selecting to enter and remain in the sample set of more recent arrived tuples is greater than that of older ones. The theoretic analysis and experiments show that the algorithm is effective and efficient for continuous data streams processing.
Keywords :
data handling; landmark windows; stratified sampling ratio function; time-varying data streams; Algorithm design and analysis; Computer science; Educational institutions; Focusing; Heuristic algorithms; Intelligent control; Medical services; data stream; landmark window; stratified sampling algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control and Automation (WCICA), 2010 8th World Congress on
Conference_Location :
Jinan
Print_ISBN :
978-1-4244-6712-9
Type :
conf
DOI :
10.1109/WCICA.2010.5554610
Filename :
5554610
Link To Document :
بازگشت