Title :
A Fast Mining and Updating Algorithm for Frequent Patterns on Biological Single Sequence
Author :
Wei, Liu ; Ling, Chen
Author_Institution :
Inst. of Inf. Sci. & Technol., Nanjing Univ. of Aeronaut. & Astronaut., Nanjing, China
Abstract :
Traditional Mining Frequent Pattems algorithms will construct lots of projected databases and generate lots of patterns with short length in the process of mining which cause the low efficiency of mining. In order to overcome the shortcomings of traditional algorithms, a fast and efficient algorithm SSPM was proposed. We used longer pattems for mining, which avoided producing lots of patterns with short length. We also used prefix tree of primary patterns for frequent patterns mining and pattern growth which could avoid producing plenty of irrelevant patterns. The experimental results show that SSPM not only improves the performance but also achieves effective mining results.
Keywords :
bioinformatics; data mining; trees (mathematics); SSPM algorithm; biological single sequence; data mining; frequent patterns algorithms; tree; Algorithm design and analysis; Bioinformatics; Classification algorithms; Data mining; Databases; Proteins; Biological Single Sequence; Prefix Tree of Primary Patterns; Sequential Frequent Pattem Mining;
Conference_Titel :
Information Technology and Applications (IFITA), 2010 International Forum on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-7621-3
Electronic_ISBN :
978-1-4244-7622-0
DOI :
10.1109/IFITA.2010.142