DocumentCode :
2369304
Title :
Correlation between computed equilibrium secondary structure free energy and siRNA efficiency
Author :
Bhattacharjee, Puranjoy ; Ramakrishnan, Naren ; Heath, Lenwood S. ; Onufriev, Alexey V.
Author_Institution :
Dept. of Comput. Sci. & Applic., Virginia Tech, Blacksburg, VA, USA
fYear :
2009
fDate :
1-4 Nov. 2009
Firstpage :
358
Lastpage :
358
Abstract :
We have explored correlations between the measured efficiency of the RNAi process and several computed signatures that characterize equilibrium secondary structure of the participating mRNA, siRNA, and their complexes. A previously published data set of 609 experimental points (with efficiency represented as percentage of remaining mRNA) was used for the analysis. While virtually no correlation with the computed structural signatures are observed for individual data points, several clear trends emerge when the ldquonoiserdquo is reduced by averaging over 10 bins of N ~ 60 data points per bin. The strongest of the trends is a positive linear (r2 = 0.87) correlation between ln(remaining mRNA) and DeltaGms, the combined free energy cost of unraveling the siRNA and creating the break in the mRNA secondary structure at the complementary target strand region. At the same time, the free energy change DeltaGtotal of the entire process mRNA + siRNA rarr (mRNA-siRNA)Complex is not correlated with RNAi efficiency, even after the averaging. These general findings appear to be robust to details of the computational protocols, suggesting that, while straightforward analysis based on equilibrium secondary structure thermodynamics may not be directly applicable to the entire RNAi process, it is applicable to at least one of its key stages. The correlation between computed DeltaGms and experimentally observed RNAi efficiency can be used to enhance the ability of a machine learning algorithm based on a support vector machine (SVM) to predict effective siRNA sequences for a given target mRNA. Specifically, we observe modest, 3 to 7%, but consistent improvement in the positive predictive value (PPV) when the SVM training set is pre- or post-filtered to half the original size according to a DeltaGms threshold.
Keywords :
biology computing; computational complexity; learning (artificial intelligence); macromolecules; physics computing; support vector machines; RNAi process; computed equilibrium secondary structure free energy; computed structural signatures; mRNA secondary structure; machine learning algorithm; positive predictive value; siRNA efficiency; support vector machine; Application software; Computer science; Costs; Energy measurement; Noise reduction; Particle measurements; Physics computing; Robustness; Support vector machines; Thermodynamics; RNA interference (RNAi); RNA secondary structure; RNAi efficiency; RNAi equilibrium thermodynamics; Support Vector Machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine Workshop, 2009. BIBMW 2009. IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-5121-0
Type :
conf
DOI :
10.1109/BIBMW.2009.5332077
Filename :
5332077
Link To Document :
بازگشت