Title :
Efficient Mining of Closed Sequential Patterns on Stream Sliding Window
Author :
Gao, Chuancong ; Wang, Jianyong ; Yang, Qingyan
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Abstract :
As a typical data mining research topic, sequential pattern mining has been studied extensively for the past decade. Recently, mining various sequential patterns incrementally over stream data has raised great interest. Due to the challenges of mining stream data, many difficulties not so obvious in static data mining have to be reconsidered carefully. In this paper, we propose a novel algorithm which stores only frequent closed prefixes in its enumeration tree structure, used for mining and maintaining patterns in the current sliding window, to solve the frequent closed sequential pattern mining problem efficiently over stream data. Some effective search space pruning and pattern closure checking strategies have been also devised to accelerate the algorithm. Experimental results show that our algorithm outperforms other state-of-the-art algorithm significantly in both running time and memory use.
Keywords :
data mining; closed sequential patterns; data mining; pattern closure checking; pattern mining; search space pruning; stream data; stream sliding window; Acceleration; Algorithm design and analysis; Data mining; Itemsets; Runtime; Unsolicited electronic mail; Closed Sequential Pattern; Sliding Window;
Conference_Titel :
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver,BC
Print_ISBN :
978-1-4577-2075-8
DOI :
10.1109/ICDM.2011.61