• DocumentCode
    2109830
  • Title

    An adaptation of Pfam profiles to predict protein sub-cellular localization in Gram positive bacteria

  • Author

    Arango-Argoty, G.A. ; Ruiz-Munoz, J.F. ; Jaramillo-Garzon, Jorge Alberto ; Castellanos-Dominguez, C.G.

  • Author_Institution
    Signal Process. & Recognition Group, Univ. Nac. de Colombia, Manizales, Colombia
  • fYear
    2012
  • fDate
    Aug. 28 2012-Sept. 1 2012
  • Firstpage
    5554
  • Lastpage
    5557
  • Abstract
    Predicting the sub-cellular localization of a protein can provide useful information to uncover its molecular functions. In this sense, numerous prediction techniques have been developed, which usually have been focused on global information of the protein or sequence alignments. However, several studies have shown that the functional nature of proteins is ruled by conserved sub-sequence patterns known as domains. In this paper, an alternative methodology (PfamFeat) for gram-positive bacterial sub-cellular localization was developed. PfamFeat is based on information provided by Pfam database, which stores a series of HMM-profiles describing common protein domains. The likelihood of a sequence, to be generated by a given HMM-profile, can be used to characterize sequences in order to use pattern recognition techniques. Success rates obtained with a simple one-nearest neighbor classifier demonstrate that this method is competitive with popular sub-cellular prediction algorithms and it constitutes a promising research trend.
  • Keywords
    bioinformatics; cellular biophysics; medical signal processing; microorganisms; molecular biophysics; molecular configurations; pattern recognition; proteins; signal classification; Gram positive bacteria; Pfam profiles; PfamFeat method; molecular functions; one-nearest neighbor classifier; pattern recognition techniques; protein domains; protein sub-cellular localization; sequence alignments; sub-cellular prediction algorithms; Databases; Extracellular; Hidden Markov models; Microorganisms; Proteins; Training; Algorithms; Computational Biology; Gram-Positive Bacteria; Subcellular Fractions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE
  • Conference_Location
    San Diego, CA
  • ISSN
    1557-170X
  • Print_ISBN
    978-1-4244-4119-8
  • Electronic_ISBN
    1557-170X
  • Type

    conf

  • DOI
    10.1109/EMBC.2012.6347252
  • Filename
    6347252