DocumentCode :
2533074
Title :
Efficiently Detecting Frequent Patterns in Biological Sequences
Author :
Liu, Wei ; Chen, Ling
Author_Institution :
Inst. of Inf. Sci. & Technol., Yangzhou Univ., Yangzhou, China
fYear :
2011
fDate :
21-23 Oct. 2011
Firstpage :
102
Lastpage :
107
Abstract :
Most of the existing algorithms for mining frequent patterns could produce lots of projected databases and short candidate patterns which could increase the time and memory cost of mining. In order to overcome such shortcoming, we propose two fast and efficient algorithms named SBPM and MSPM for mining frequent patterns in single and multiple biological respectively. We first present the concept of primary pattern, and then use prefix tree for mining frequent primary patterns. A pattern growth approach is also presented to mine all the frequent patterns without producing large amount of irrelevant patterns. Our experimental results show that our algorithms not only improve the performance but also achieve effective mining results.
Keywords :
biology computing; data mining; database management systems; pattern recognition; biological sequences; databases; frequent patterns detection; frequent patterns mining; Algorithm design and analysis; Complexity theory; Data mining; Proteins; Silicon; Vectors; biological sequence; frequent pattern mining; prefix tree; primary pattern;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Information Systems and Applications Conference (WISA), 2011 Eighth
Conference_Location :
Chongqing
Print_ISBN :
978-1-4577-1812-0
Type :
conf
DOI :
10.1109/WISA.2011.27
Filename :
6093574
Link To Document :
بازگشت