• DocumentCode
    1018436
  • Title

    Machine Learning Techniques for the Automated Classification of Adhesin-Like Proteins in the Human Protozoan Parasite Trypanosoma cruzi

  • Author

    González, Ana M. ; Azuaje, Francisco J. ; Ramírez, José L. ; Da Silveira, José F. ; Dorronsoro, José R.

  • Author_Institution
    Comput. Sci. Dept., Univ. Autonoma de Madrid, Madrid, Spain
  • Volume
    6
  • Issue
    4
  • fYear
    2009
  • Firstpage
    695
  • Lastpage
    702
  • Abstract
    This paper reports on the evaluation of different machine learning techniques for the automated classification of coding gene sequences obtained from several organisms in terms of their functional role as adhesins. Diverse, biologically-meaningful, sequence-based features were extracted from the sequences and used as inputs to the in silico prediction models. Another contribution of this work is the generation of potentially novel and testable predictions about the surface protein DGF-1 family in Trypanosoma cruzi. Finally, these techniques are potentially useful for the automated annotation of known adhesin-like proteins from the trans-sialidase surface protein family in T. cruzi, the etiological agent of Chagas disease.
  • Keywords
    diseases; feature extraction; genomics; learning (artificial intelligence); medical diagnostic computing; pattern classification; proteins; Chagas disease; Trypanosoma cruzi; adhesin-like proteins; coding gene sequences; feature extraction; human protozoan parasite; in silico prediction model; machine learning techniques; trans-sialidase surface protein family; Chagas disease; Machine learning; Pattern Recognition; adhesin-like proteins; genomic data mining; machine learning.; Animals; Artificial Intelligence; Chagas Disease; Computational Biology; Databases, Protein; Glycoproteins; Humans; Membrane Proteins; Models, Statistical; Multigene Family; Neuraminidase; Proteomics; Protozoan Proteins; Trypanosoma cruzi;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2008.125
  • Filename
    4695820