• DocumentCode
    932298
  • Title

    Bayesian segmental models with multiple sequence alignment profiles for protein secondary structure and contact map prediction

  • Author

    Wei Chu ; Ghahramani, Z. ; Podtelezhnikov, A. ; Wild, D.L.

  • Author_Institution
    Gatsby Comput. Neurosci. Unit, Univ. Coll. London
  • Volume
    3
  • Issue
    2
  • fYear
    2006
  • Firstpage
    98
  • Lastpage
    113
  • Abstract
    In this paper, we develop a segmental semi-Markov model (SSMM) for protein secondary structure prediction which incorporates multiple sequence alignment profiles with the purpose of improving the predictive performance. The segmental model is a generalization of the hidden Markov model where a hidden state generates segments of various length and secondary structure type. A novel parameterized model is proposed for the likelihood function that explicitly represents multiple sequence alignment profiles to capture the segmental conformation. Numerical results on benchmark data sets show that incorporating the profiles results in substantial improvements and the generalization performance is promising. By incorporating the information from long range interactions in beta-sheets, this model is also capable of carrying out inference on contact maps. This is an important advantage of probabilistic generative models over the traditional discriminative approach to protein secondary structure prediction. The Web server of our algorithm and supplementary materials are available at http://public.kgi.edu/-wild/bsm.html
  • Keywords
    Bayes methods; biology computing; hidden Markov models; molecular biophysics; molecular configurations; probability; proteins; Bayesian segmental models; beta-sheets; contact map prediction; hidden Markov model; likelihood function; multiple sequence alignment profiles; probabilistic generative models; protein secondary structure; segmental conformation; segmental semi-Markov model; Accuracy; Bayesian methods; Biological system modeling; Hidden Markov models; Inference algorithms; Neural networks; Predictive models; Protein engineering; Sequences; Web server; Bayesian segmental semi-Markov models; contact maps; generative models; multiple sequence alignment profiles; parametric models.; protein secondary structure; Algorithms; Bayes Theorem; Computational Biology; Databases, Protein; Elapid Venoms; Internet; Likelihood Functions; Markov Chains; Models, Molecular; Models, Statistical; Nerve Tissue Proteins; Protein Structure, Secondary; ROC Curve; Reproducibility of Results; Sequence Alignment;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2006.17
  • Filename
    1631992