Title :
R-PASS: A Fast Structure-Based RNA Sequence Alignment Algorithm
Author :
Jiang, Yanan ; Xu, Weijia ; Thompson, Lee Parnell ; Gutell, Robin R. ; Miranker, Daniel P.
Author_Institution :
Inst. for Cellular & Mol. Biol., Univ. of Texas at Austin, Austin, TX, USA
Abstract :
We present a fast pairwise RNA sequence alignment method using structural information, named R PASS (RNA Pairwise Alignment of Structure and Sequence), which shows good accuracy on sequences with low sequence identity and significantly faster than alternative methods. The method begins by representing RNA secondary structure as a set of structure motifs. The motifs from two RNAs are then used as input into a bipartite graph-matching algorithm, which determines the structure matches. The matches are then used as constraints in a constrained dynamic programming sequence alignment procedure. The R-PASS method has an O(nm) complexity. We compare our method with two other structure-based alignment methods, LARA and ExpaLoc, and with a sequence-based alignment method, MAFFT, across three benchmarks and obtain favorable results in accuracy and orders of magnitude faster in speed.
Keywords :
RNA; biology; computational complexity; dynamic programming; R-PASS method; RNA secondary; complexity; constrained dynamic programming sequence alignment; fast pairwise RNA sequence alignment; fast structure-based RNA sequence alignment algorithm; pairwise alignment of structure and sequence; sequence identity; structural information; structure-based alignment; Accuracy; Bioinformatics; Bipartite graph; Complexity theory; Compounds; RNA; Vectors; RNA pairwise structural alignment; bipartite graph matching; constraint sequence alignment; structure motif;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2011 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4577-1799-4
DOI :
10.1109/BIBM.2011.74