DocumentCode
2109830
Title
An adaptation of Pfam profiles to predict protein sub-cellular localization in Gram positive bacteria
Author
Arango-Argoty, G.A. ; Ruiz-Munoz, J.F. ; Jaramillo-Garzon, Jorge Alberto ; Castellanos-Dominguez, C.G.
Author_Institution
Signal Process. & Recognition Group, Univ. Nac. de Colombia, Manizales, Colombia
fYear
2012
fDate
Aug. 28 2012-Sept. 1 2012
Firstpage
5554
Lastpage
5557
Abstract
Predicting the sub-cellular localization of a protein can provide useful information to uncover its molecular functions. In this sense, numerous prediction techniques have been developed, which usually have been focused on global information of the protein or sequence alignments. However, several studies have shown that the functional nature of proteins is ruled by conserved sub-sequence patterns known as domains. In this paper, an alternative methodology (PfamFeat) for gram-positive bacterial sub-cellular localization was developed. PfamFeat is based on information provided by Pfam database, which stores a series of HMM-profiles describing common protein domains. The likelihood of a sequence, to be generated by a given HMM-profile, can be used to characterize sequences in order to use pattern recognition techniques. Success rates obtained with a simple one-nearest neighbor classifier demonstrate that this method is competitive with popular sub-cellular prediction algorithms and it constitutes a promising research trend.
Keywords
bioinformatics; cellular biophysics; medical signal processing; microorganisms; molecular biophysics; molecular configurations; pattern recognition; proteins; signal classification; Gram positive bacteria; Pfam profiles; PfamFeat method; molecular functions; one-nearest neighbor classifier; pattern recognition techniques; protein domains; protein sub-cellular localization; sequence alignments; sub-cellular prediction algorithms; Databases; Extracellular; Hidden Markov models; Microorganisms; Proteins; Training; Algorithms; Computational Biology; Gram-Positive Bacteria; Subcellular Fractions;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE
Conference_Location
San Diego, CA
ISSN
1557-170X
Print_ISBN
978-1-4244-4119-8
Electronic_ISBN
1557-170X
Type
conf
DOI
10.1109/EMBC.2012.6347252
Filename
6347252
Link To Document