DocumentCode :
3324106
Title :
FLAME: Shedding Light on Hidden Frequent Patterns in Sequence Datasets
Author :
Tata, Sandeep ; Patel, Jignesh M.
Author_Institution :
Almaden Res. Center, IBM, San Jose, CA
fYear :
2008
fDate :
7-12 April 2008
Firstpage :
1343
Lastpage :
1345
Abstract :
Existing database sequence mining algorithms focus on mining for subsequences. However, for many emerging applications, the subsequence model is inadequate for detecting interesting patterns. Often, an approximate substring model better accommodates the notion of a noisy pattern. In this paper, we present a powerful new model for approximate pattern mining. We show that this model can be used to capture the notion of an approximate match for a variety of different applications. We also present a novel, suffix tree based pattern mining algorithm called FLAME and demonstrate that it is a fast, accurate, and scalable method for discovering hidden patterns in large sequence databases.
Keywords :
data mining; pattern recognition; very large databases; FLAME; database sequence mining; hidden frequent patterns; large sequence databases; noisy pattern; pattern detection; pattern mining; sequence datasets; subsequence model; Computational biology; Data mining; Databases; Fires; Fluctuations; Heart; Inspection; Pattern analysis; Pattern matching; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-1836-7
Electronic_ISBN :
978-1-4244-1837-4
Type :
conf
DOI :
10.1109/ICDE.2008.4497550
Filename :
4497550
Link To Document :
بازگشت