Title :
Mining Top-K Sequential Patterns in the Data Stream Environment
Author :
Dai, Bi-Ru ; Jiang, Hung-Lin ; Chung, Chih-Heng
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ. of Sci. & Technol., Taipei, Taiwan
Abstract :
Sequential pattern mining is a process of extracting useful patterns in data sequences. Existing works on mining Top-K patterns on data streams are mostly for non-sequential patterns. In our framework, we focus on the topic of Top-K sequential pattern mining, where users can obtain adequate amount of interesting patterns. The proposed method can automatically adjust the minimum support during mining each batch in the data stream to obtain candidate patterns. Then candidate patterns are maintained by a tree structure for extracting Top-K sequential patterns. Empirical results show that the proposed method is efficient and scalable.
Keywords :
data mining; candidate patterns; data stream environment; top-k sequential pattern extraction; top-k sequential pattern mining; tree structure; Data mining; Top-K sequential patterns; data stream;
Conference_Titel :
Technologies and Applications of Artificial Intelligence (TAAI), 2010 International Conference on
Conference_Location :
Hsinchu City
Print_ISBN :
978-1-4244-8668-7
Electronic_ISBN :
978-0-7695-4253-9
DOI :
10.1109/TAAI.2010.33