DocumentCode :
1809163
Title :
The ruzzo-tompa algorithm can find the maximal paths in weighted, directed graphs on a one-dimensional lattice
Author :
Spouge, John L. ; Mariño-Ramírez, Leonardo ; Sheetlin, Sergey L.
Author_Institution :
Comput. Biol. Branch, Nat. Center for Biotechnol. Inf., Bethesda, MD, USA
fYear :
2012
fDate :
23-25 Feb. 2012
Firstpage :
1
Lastpage :
6
Abstract :
Biological sequences can contain regions of unusual composition, e.g., proteins contain DNA binding domains, transmembrane regions, and charged regions. The linear-time Ruzzo-Tompa algorithm finds such regions by inputting a sequence of scores and outputting the corresponding “maximal segments”, i.e., contiguous, disjoint subsequences having the greatest total scores. Just as gaps improved the sensitivity of BLAST searches, they might improve the sensitivity of searches for regions of unusual composition as well. Accordingly, we generalize the Ruzzo-Tompa algorithm from sequences of scores to paths in weighted, directed graphs on a one-dimensional lattice. Within the generalization, unfavorable scores can be deleted from contiguous, disjoint subsequences by paying a penalty, and the Ruzzo-Tompa algorithm can then find gapped subsequences having the greatest total gapped scores. An application to finding gapped inexact repeats in biological sequences exemplifies some of the concepts.
Keywords :
DNA; biochemistry; molecular biophysics; proteins; DNA binding domains; Ruzzo-Tompa algorithm; biological sequences; directed graphs; maximal paths; maximal segments; one-dimensional lattice; proteins; total gapped scores; transmembrane regions; weighted graphs; Algorithm design and analysis; Amino acids; Context; Lattices; Proteins; Sensitivity; gapped Ruzzo-Tompa algorithm; locally optimal subsequence; maximal subsegment; sequence analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Advances in Bio and Medical Sciences (ICCABS), 2012 IEEE 2nd International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4673-1320-9
Electronic_ISBN :
978-1-4673-1319-3
Type :
conf
DOI :
10.1109/ICCABS.2012.6182645
Filename :
6182645
Link To Document :
بازگشت