Title :
Efficiently Detecting Frequent Patterns in Biological Sequences
Author :
Liu, Wei ; Chen, Ling
Author_Institution :
Inst. of Inf. Sci. & Technol., Yangzhou Univ., Yangzhou, China
Abstract :
Most of the existing algorithms for mining frequent patterns could produce lots of projected databases and short candidate patterns which could increase the time and memory cost of mining. In order to overcome such shortcoming, we propose two fast and efficient algorithms named SBPM and MSPM for mining frequent patterns in single and multiple biological respectively. We first present the concept of primary pattern, and then use prefix tree for mining frequent primary patterns. A pattern growth approach is also presented to mine all the frequent patterns without producing large amount of irrelevant patterns. Our experimental results show that our algorithms not only improve the performance but also achieve effective mining results.
Keywords :
biology computing; data mining; database management systems; pattern recognition; biological sequences; databases; frequent patterns detection; frequent patterns mining; Algorithm design and analysis; Complexity theory; Data mining; Proteins; Silicon; Vectors; biological sequence; frequent pattern mining; prefix tree; primary pattern;
Conference_Titel :
Web Information Systems and Applications Conference (WISA), 2011 Eighth
Conference_Location :
Chongqing
Print_ISBN :
978-1-4577-1812-0
DOI :
10.1109/WISA.2011.27