• DocumentCode
    667194
  • Title

    Modified free energy model to improve RNA secondary structure prediction with pseudoknots

  • Author

    Kwok-Kit Tong ; Kwan-Yau Cheung ; Kin-Hong Lee ; Kwong-Sak Leung

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Chinese Univ. of Hong Kong, Shatin, China
  • fYear
    2013
  • fDate
    10-13 Nov. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    The free energy (evaluation) models used in RNA secondary structure prediction are one of the most important reasons that makes the prediction a challenging computational problem in Bioinformatics. These models are the key factor determining the accuracy of the prediction algorithms. Previously we have developed a method called GAknot that has obtained good performance on predicting RNA secondary structures with pseudoknots. In this paper, we propose a new free energy model. We first select a number of RNA sequences from a database which contains known RNA secondary structures as a training dataset for learning this new model. From the training dataset, we then extract base pairs patterns in subsequences of pairs of k-mers from the stems of each sequence in the training data and use the patterns to formulate penalty factors. We modify the energy model by adding these penalty factors. Combined with the new modified energy model, the prediction performance of GAknot has been improved significantly. GAknot with the new modified energy model is shown to be the best method in comparison with two state-of-the-art algorithms using a commonly used testing dataset. The penalty factors of the new energy model and dataset can be downloaded at http://appsrv.cse.cuhk.edu.hk/~kktong/NewModel.
  • Keywords
    RNA; bioinformatics; free energy; molecular biophysics; molecular configurations; Bioinformatics; GAknot; RNA secondary structure prediction; modified free energy model; penalty factors; prediction algorithm; pseudoknots; Accuracy; Biological system modeling; Prediction algorithms; Predictive models; RNA; Training; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering (BIBE), 2013 IEEE 13th International Conference on
  • Conference_Location
    Chania
  • Type

    conf

  • DOI
    10.1109/BIBE.2013.6701532
  • Filename
    6701532