Title :
Efficient processing of semi-stream data
Author :
Naeem, Muhammad A.
Author_Institution :
Sch. of Comput. & Math. Sci., Auckland Univ. of Technol., Auckland, New Zealand
Abstract :
Semi-stream processing has become an emerging area of research in the field of data stream management. One common operation in semi-stream processing is joining a stream with disk-based data using a join operator. This join operator typically works under limited main memory and this memory is generally not large enough to hold the whole disk-based data. Recently, a number of semi-stream join algorithms have been proposed in the literature to achieve an optimal performance but still there is room to improve the performance. In this invited paper we propose a novel Semi-Stream Join using a new cache module. The algorithm is more appropriate for skewed distributions, and particularly we consider Zipfian distributions of the type that appears in many applications. For such distributions the new algorithm significantly outperforms the existing approaches.
Keywords :
cache storage; data handling; Zipfian distributions; cache module; data stream management; semistream data processing; semistream join; skewed distributions; Algorithm design and analysis; Buffer storage; Database systems; Educational institutions; Mobile communication; Real-time systems; Warehousing;
Conference_Titel :
Digital Information Management (ICDIM), 2013 Eighth International Conference on
Conference_Location :
Islamabad
Print_ISBN :
978-1-4799-0613-0
DOI :
10.1109/ICDIM.2013.6694035