Title :
Discovering interesting sequential pattern in large sequence database
Author :
Cui, Wei ; An, Haizhong
Author_Institution :
Sch. of Humanlities & Economic Manage., China Univ. of Geosci., Beijing, China
Abstract :
Sequential pattern mining is an important data mining problem with broad applications. Most previous sequential mining algorithms generate an exponentially large number of sequential patterns. In addition, all items and sequences are treated uniformly. It would be better if the unimportant patterns could be pruned first, resulting in fewer but important patterns after mining. In this paper, we suggest a new algorithm for mining interesting sequential patterns. On the one hand, the resulting patterns are maximal which reduce the number of discovered sequences. On the other hand, weights are used to discover only important sequential patterns. To enhance the miming efficiency, it is proved that the downward closure property of frequent pattern is also retained in the proposed algorithm. Experimental results show that the algorithm is efficient and effective.
Keywords :
data mining; very large databases; data mining; interesting sequential pattern mining; large sequence database; mining efficiency enhancement; Computational intelligence; Computer industry; Data mining; Databases; Environmental economics; Environmental management; Geology; Human resource management; Industrial economics; Sequences; Data mining; maximal frequent sequence; weight;
Conference_Titel :
Computational Intelligence and Industrial Applications, 2009. PACIIA 2009. Asia-Pacific Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4606-3
DOI :
10.1109/PACIIA.2009.5406655