مرکز منطقه ای اطلاع رساني علوم و فناوري - Parametrized Stochastic Grammars for RNA Secondary Structure Prediction

DocumentCode :

1865908

Title :

Parametrized Stochastic Grammars for RNA Secondary Structure Prediction

Author :

Maier, Robert S.

Author_Institution :

Univ. of Arizona, Tucson

fYear :

2007

fDate :

Jan. 29 2007-Feb. 2 2007

Firstpage :

256

Lastpage :

260

Abstract :

We propose a two-level stochastic context-free grammar (SCFG) architecture for parametrized stochastic modeling of a family of RNA sequences, including their secondary structure. A stochastic model of this type can be used for maximum a posteriori estimation of the secondary structure of any new sequence in the family. The proposed SCFG architecture models RNA subsequences comprising paired bases as stochastically weighted Dyck-language words, i.e., as weighted balanced- parenthesis expressions. The length of each run of unpaired bases, forming a loop or a bulge, is taken to have a phase-type distribution: that of the hitting time in a finite-state Markov chain. Without loss of generality, each such Markov chain can be taken to have a bounded complexity. The scheme yields an overall family SCFG with a manageable number of parameters.

Keywords :

Markov processes; biology computing; context-free grammars; macromolecules; maximum likelihood estimation; molecular biophysics; organic compounds; statistical distributions; Dyck-language words; RNA secondary structure; RNA sequence; finite-state Markov chain; maximum a posteriori estimation; parametrized stochastic modelling; phase-type distribution; probability distribution; stochastic context-free grammar; Biological system modeling; Context modeling; Hidden Markov models; Mathematical model; Parameter estimation; Predictive models; Probability distribution; RNA; Sequences; Stochastic processes;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Information Theory and Applications Workshop, 2007

Conference_Location :

La Jolla, CA

Print_ISBN :

978-0-615-15314-8

Type :

conf

DOI :

10.1109/ITA.2007.4357589

Filename :

4357589

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1865908