DocumentCode :
2130918
Title :
An Efficient Sequential Pattern Mining Algorithm Based on the 2-Sequence Matrix
Author :
Hsieh, Chia-Ying ; Yang, Don-Lin ; Wu, Jungpin
Author_Institution :
Dept. of Inf., Eng. & Comput. Sci., Feng Chia Univ., Taichung
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
583
Lastpage :
591
Abstract :
Sequential pattern mining has become more and more popular in recent years due to its wide applications and the fact that it can find more information than association rules. Two famous algorithms in sequential pattern mining are AprioriAll and PrefixSpan. These two algorithms not only need to scan a database or projected-databases many times, but also require setting a minimal support threshold to prune infrequent data to obtain useful sequential patterns efficiently. In addition, they must rescan the database if new items or sequences are added. In this paper, we propose a novel algorithm called efficient sequential pattern enumeration (ESPE) to solve the above problems. In addition, our method can be applied in many applications, such as for the itemsets appearing at the same time in a sequence. In our experiments, we show that the performance of ESPE is better than the other two methods using various datasets.
Keywords :
data mining; matrix algebra; 2-sequence matrix; AprioriAll; PrefixSpan; association rules; efficient sequential pattern enumeration; sequential pattern mining algorithm; Application software; Association rules; Bioinformatics; Computer science; Conferences; Data engineering; Data mining; Databases; Itemsets; Statistics; Sequential pattern; association rule; candidate enumeration; data mining; minimum support;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops, 2008. ICDMW '08. IEEE International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-0-7695-3503-6
Electronic_ISBN :
978-0-7695-3503-6
Type :
conf
DOI :
10.1109/ICDMW.2008.82
Filename :
4733982
Link To Document :
بازگشت