Title :
Improved Algorithms for Parsing ESLTAGs: A Grammatical Model Suitable for RNA Pseudoknots
Author :
Rajasekaran, Sanguthevar ; Al Seesi, Sahar ; Ammar, Reda A.
Author_Institution :
Comput. Sci. & Eng. Dept., Univ. of Connecticut, Storrs, CT, USA
Abstract :
Formal grammars have been employed in biology to solve various important problems. In particular, grammars have been used to model and predict RNA structures. Two such grammars are Simple Linear Tree Adjoining Grammars (SLTAGs) and Extended SLTAGs (ESLTAGs). Performances of techniques that employ grammatical formalisms critically depend on the efficiency of the underlying parsing algorithms. In this paper, we present efficient algorithms for parsing SLTAGs and ESLTAGs. Our algorithm for SLTAGs parsing takes O(min{m, n4}) time and O(min{m, n4}) space, where m is the number of entries that will ever be made in the matrix M (that is normally used by TAG parsing algorithms). Our algorithm for ESLTAGs parsing takes O(nmin{m, n4}) time and O(min{m, n4}) space. We show that these algorithms perform better, in practice, than the algorithms of Uemura et al..
Keywords :
biology computing; grammars; molecular biophysics; organic compounds; ESLTAG parsing; Extended SLTAG; RNA pseudoknots; RNA structures; Simple Linear Tree Adjoining Grammars; formal grammars; grammatical model; improved algorithms; Algorithm design and analysis; Bioinformatics; Biological system modeling; Dynamic programming; Genomics; Inference algorithms; Predictive models; RNA; Technical Activities Guide -TAG; Testing; RNA structure analysis; parsing algorithms.; tree adjoining grammars; Algorithms; Base Sequence; Models, Molecular; Molecular Sequence Data; Nucleic Acid Conformation; RNA; Software;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2010.54