DocumentCode :
661207
Title :
Efficient processing of semi-stream data
Author :
Naeem, Muhammad A.
Author_Institution :
Sch. of Comput. & Math. Sci., Auckland Univ. of Technol., Auckland, New Zealand
fYear :
2013
fDate :
10-12 Sept. 2013
Firstpage :
7
Lastpage :
10
Abstract :
Semi-stream processing has become an emerging area of research in the field of data stream management. One common operation in semi-stream processing is joining a stream with disk-based data using a join operator. This join operator typically works under limited main memory and this memory is generally not large enough to hold the whole disk-based data. Recently, a number of semi-stream join algorithms have been proposed in the literature to achieve an optimal performance but still there is room to improve the performance. In this invited paper we propose a novel Semi-Stream Join using a new cache module. The algorithm is more appropriate for skewed distributions, and particularly we consider Zipfian distributions of the type that appears in many applications. For such distributions the new algorithm significantly outperforms the existing approaches.
Keywords :
cache storage; data handling; Zipfian distributions; cache module; data stream management; semistream data processing; semistream join; skewed distributions; Algorithm design and analysis; Buffer storage; Database systems; Educational institutions; Mobile communication; Real-time systems; Warehousing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Information Management (ICDIM), 2013 Eighth International Conference on
Conference_Location :
Islamabad
Print_ISBN :
978-1-4799-0613-0
Type :
conf
DOI :
10.1109/ICDIM.2013.6694035
Filename :
6694035
Link To Document :
بازگشت