DocumentCode :
3074132
Title :
Asymptotics of Canonical RNA Secondary Structures
Author :
Clote, Peter ; Kranakis, Evangelos ; Krizanc, Danny
Author_Institution :
Biol. Dept., Boston Coll., Chestnut Hill, MA, USA
fYear :
2009
fDate :
22-24 June 2009
Firstpage :
184
Lastpage :
191
Abstract :
It is a classical result of Stein and Waterman that the asymptotic number S(n) of RNA secondary structures is 1.104366 ldr n-3/2 ldr 2.618034n, where the combinatorial model of RNA concerns a length n homopolymer, such that any base can pair with any other base, subject to the usual convention that hairpin loops must contain at least thetas = 1 unpaired bases. The result of Stein and Waterman is proved by developing recursions,using generating functions and applying Bender´s theorem. These recursions form the basis to compute the minimum free energy secondary structure for a given RNA sequence, with respect to the Nussinov energy model, later extended by Zuker to substantially more complicated resursions for the Turner nearest neighbor energy model. In this paper, we study combinatorial asymptotic for two special subclasses of RNA secondary structures - canonical and saturated structures. Canonical secondary structures are defined to have no lonely (isolated) base pairs. This class of secondary structures was introduced b y Bompfuenewerer et al., who noted that the runtime of Vienna RNA Package is substantially decreased when restricting computations to canonical structures. Here we provide an explanation for the speed-up, by proving that the asymptotic number of canonical RNA secondary structures is 2.1614 ldr n-3/2 ldr 1.96798n. Saturated secondary structures have the property that no base pairs can be added without violating the definition of secondary structure (i.e. introducing a pseudoknotor base triple). In the Nussinov energy model,where the energy for a base pair is -1, saturated structures correspond to kinetic traps.n prior work, we showed that the asymptotic number of saturated structures of a length n homopolymer is 1.07427 ldr n-3/2 ldr 2.35467n. In this paper, we show that the expected number of base pairs of random saturated structures, generated by a natural stochastic procedure, is - (zthetas+1)/((1-z)2) (-z-Sigmai=0 thetas (z2)/(i+1)) (int e (z+Sigmai=0 thetas (z2)/(i+1))dz).
Keywords :
macromolecules; molecular biophysics; molecular configurations; organic compounds; polymers; stochastic processes; Nussinov energy model; Vienna RNA Package; canonical RNA secondary structures; combinatorial asymptotics; homopolymer; kinetic traps; natural stochastic procedure; saturated structure; Amino acids; Bioinformatics; Biomedical engineering; Computational biology; Computer science; Educational institutions; Mathematical model; Mathematics; RNA; USA Councils; RNA; asymptotic combinatorics; secondary structure;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and BioEngineering, 2009. BIBE '09. Ninth IEEE International Conference on
Conference_Location :
Taichung
Print_ISBN :
978-0-7695-3656-9
Type :
conf
DOI :
10.1109/BIBE.2009.76
Filename :
5211290
Link To Document :
بازگشت