DocumentCode :
1987079
Title :
LOGOS: a modular Bayesian model for de novo motif detection
Author :
Xing, Eric P. ; Wu, Wei ; Jordan, Michael I. ; Karp, Richard M.
Author_Institution :
Div. of Comput. Sci., California Univ., Berkeley, CA, USA
fYear :
2003
fDate :
11-14 Aug. 2003
Firstpage :
266
Lastpage :
276
Abstract :
The complexity of the global organization and internal structures of motifs in higher eukaryotic organisms raises significant challenges for motif detection techniques. To achieve successful de novo motif detection it is necessary to model the complex dependencies within and among motifs and incorporate biological prior knowledge. In this paper, we present LOGOS, an integrated LOcal and GlObal motif Sequence model for biopolymer sequences, which provides a principled framework for developing, modularizing, extending and computing expressive motif models for complex biopolymer sequence analysis. LOGOS consists of two interacting submodels: HMDM, a local alignment model capturing biological prior knowledge and positional dependence within the motif local structure; and HMM, a global motif distribution model modeling frequencies and dependencies of motif occurrences. Model parameters can be fit using training motifs within an empirical Bayesian framework. A variational EM algorithm is developed for de novo motif detection. LOGOS improves over existing models that ignore biological priors and dependencies in motif structures and motif occurrences, and demonstrates superior performance on both semirealistic test data and cis-regulatory sequences from yeast and Drosophila sequences with regard to sensitivity, specificity, flexibility and extensibility.
Keywords :
Bayes methods; biology computing; cellular biophysics; genetic algorithms; genetics; hidden Markov models; microorganisms; molecular biophysics; physiological models; polymers; Drosophila sequences; GlObal motif Sequence model; HMDM; HMM; LOGOS; LOcal motif Sequence model; biological prior knowledge; cis-regulatory sequences; complex biopolymer sequence analysis; de novo motif detection technique; empirical Bayesian framework; expressive motif models computing; global motif distribution model; higher eukaryotic organisms; local alignment model; model parameters; modeling frequencies; modular Bayesian model; motif occurrences dependencies; motifs internal structures; semirealistic test data; training motifs; variational EM algorithm; yeast; Bayesian methods; Bioinformatics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
Print_ISBN :
0-7695-2000-6
Type :
conf
DOI :
10.1109/CSB.2003.1227327
Filename :
1227327
Link To Document :
بازگشت