DocumentCode :
2824537
Title :
A Bit-Encoding Approach for Frequent Sequence Mining
Author :
Zhu, Zhaoliang ; Mu, Xuefeng ; Zhou, Gang
Author_Institution :
State Key Lab. of Software Dev. Environ., Beijing Univ. of Aeronaut. & Astronaut., Beijing, China
fYear :
2009
fDate :
11-13 Dec. 2009
Firstpage :
1
Lastpage :
7
Abstract :
Sequential pattern mining is an important data mining problem with broad applications. And the classic algorithm for this problem is PrefixSpan, which is a projection-based, sequential pattern-growth approach. But it must scan projected database once in order to find a local frequent item, it may cost much time when the database is large. In this paper, we propose an encoding-based, sequential pattern-growth approach for efficient mining of sequential patterns, PSBE (PrefixSpan with Bit-Encoding), which can find local frequent item with little cost. In this approach, we encode the database with 0,1 bits, build encoded database, and use the framework of PrefixSpan to find out sequential patterns. We can improve the PrefixSpan by the property of encoding. Our experiment shows that PSBE outperforms the PrefixSpan algorithm.
Keywords :
data mining; deductive databases; encoding; PrefixSpan algorithm; bit encoding; data mining problem; sequential database; sequential pattern mining; Application software; Costs; Data mining; Databases; Diseases; Encoding; Frequency; Pattern analysis; Programming; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4507-3
Electronic_ISBN :
978-1-4244-4507-3
Type :
conf
DOI :
10.1109/CISE.2009.5363782
Filename :
5363782
Link To Document :
بازگشت