DocumentCode :
2455740
Title :
An efficient algorithm for protein sequence pattern mining
Author :
Zhou, Qingda ; Jiang, Qingshan ; Li, Sheng ; Xie, Xiaobiao ; Lin, Lida
Author_Institution :
Software Sch., Xiamen Univ., Xiamen, China
fYear :
2010
fDate :
24-27 Aug. 2010
Firstpage :
1876
Lastpage :
1881
Abstract :
Protein Sequence is a very important part of biological sequence data, to which the analysis and study have become an important research direction and content in bioinformatics domain. Through the pattern mining to the sequence, some study can be performed on a protein sequence or a protein family sequence, making the protein sequence pattern mining of protein sequences a much important task in this field. MBioPM is one of the latest biological sequence pattern mining algorithm by introducing the concept of pattern classification to improve its efficiency, but the efficiency of the algorithm is still unsatisfied, and there are redundant issues in mining results. Therefore, this paper proposes a pattern mining algorithm mMBioPM to improve the efficiency by optimizing Hash list structures with pattern partition characteristics and reducing the running time. Experiments show that our optimized mMBioPM algorithm can effectively improve the efficiency and solve the redundancy problem in the results.
Keywords :
bioinformatics; data mining; molecular biophysics; pattern classification; proteins; bioinformatics domain; biological sequence data; biological sequence pattern mining algorithm; hash list structures; mMBioPM algorithm; pattern classification; pattern partition characteristics; protein family sequence; protein sequence pattern mining; redundancy problem; Algorithm design and analysis; Arrays; Data mining; Pattern matching; Protein sequence; Redundancy; bioinformatics; data mining; pattern mining; protein sequence;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Education (ICCSE), 2010 5th International Conference on
Conference_Location :
Hefei
Print_ISBN :
978-1-4244-6002-1
Type :
conf
DOI :
10.1109/ICCSE.2010.5593815
Filename :
5593815
Link To Document :
بازگشت