DocumentCode :
3410540
Title :
A hidden Markov model for gene function prediction from sequential expression data
Author :
Deng, Xutao ; Ali, Hesham
Author_Institution :
Coll. of Inf. Sci. & Technol., Nebraska Univ., Omaha, NE, USA
fYear :
2004
fDate :
16-19 Aug. 2004
Firstpage :
670
Lastpage :
671
Abstract :
Hidden Markov models (HMMs) have demonstrated great successes in modeling noisy sequential data sets in the area of speech recognition and protein sequence profiling. Results from association test showed significant Markov dependency in time-series gene expression data, and therefore HMMs would be especially appropriate for modeling gene expressions. In this project, we developed a gene function prediction tool based on profile HMMs. Each function class is associated with a distinct HMM whose parameters are trained using yeast time-series gene expression data. The function annotations of the HMM training set were obtained from Munich Information Centre for Protein Sequences (MIPS) data base. We designed several structural variants of HMMs (single, double-split) and tested each of them on forty function classes each of which includes more than one hundred instances. The highest prediction sensitivity we achieved is 51% by using double-split HMM with 3-fold cross-validation.
Keywords :
biology computing; genetics; hidden Markov models; molecular biophysics; prediction theory; proteins; time series; Munich Information Centre for Protein Sequences data base; function annotations; gene function prediction; hidden Markov model; sequential expression data; yeast time-series gene expression data; Accuracy; Bayesian methods; Bioinformatics; Educational institutions; Fungi; Gene expression; Genomics; Hidden Markov models; Information science; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN :
0-7695-2194-0
Type :
conf
DOI :
10.1109/CSB.2004.1332541
Filename :
1332541
Link To Document :
بازگشت