DocumentCode :
1611984
Title :
Protein modeling using hidden Markov models: analysis of globins
Author :
Haussler, David ; Krogh, Anders ; Mian, I. Saira ; Sjölander, Kimmen
Author_Institution :
California Univ., Santa Cruz, CA, USA
fYear :
1993
Firstpage :
792
Abstract :
The authors apply hidden Markov models to the problem of statistical modeling and multiple sequence alignment of protein families. A variant of the expectation maximization algorithm known as the Viterbi algorithm is used to obtain the statistical model from the unaligned sequences. In a detailed series of experiments, they have taken 400 unaligned globin sequences, and produced a statistical model entirely automatically from the primary sequences. The authors used no prior knowledge of globin structure. Using this model, a multiple alignment of the 400 sequences and 225 other globin sequences was obtained that agrees almost perfectly with a structural alignment by D. Bashford et al. (1987). This model can also discriminate all these 625 globins from nonglobin protein sequences with greater than 99% accuracy, and can thus be used for database searches.
Keywords :
hidden Markov models; physiological models; proteins; statistical analysis; Viterbi algorithm; database searches; expectation maximization algorithm; globins; hidden Markov models; multiple alignment; multiple sequence alignment; protein modelling; statistical modeling; Amino acids; Databases; Hidden Markov models; Laboratories; Predictive models; Probability; Proteins; Sequences; Transportation; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences, 1993, Proceeding of the Twenty-Sixth Hawaii International Conference on
Print_ISBN :
0-8186-3230-5
Type :
conf
DOI :
10.1109/HICSS.1993.270611
Filename :
270611
Link To Document :
بازگشت