DocumentCode
952248
Title
Fast and Practical Algorithms for Planted (l, d) Motif Search
Author
Davila, Jaime ; Balla, Sudha ; Rajasekaran, Sanguthevar
Author_Institution
Univ. of Connecticut, Storrs
Volume
4
Issue
4
fYear
2007
Firstpage
544
Lastpage
552
Abstract
We consider the planted (I, d) motif search problem, which consists of finding a substring of length I that occurs in a set of input sequences {si,. ..,sn} with up to d errors, a problem that arises from the need to find transcription factor-binding sites in genomic information. We propose a sequence of practical algorithms, which start based on the ideas considered in PMS1. These algorithms are exact, have little space requirements, and are able to tackle challenging instances with bigger d, taking less time in the instances reported solved by exact algorithms. In particular, one of the proposed algorithms, PMSprune, is able to solve the challenging instances, such as (17, 6) and (19, 7), which were not previously reported as solved in the literature.
Keywords
biology computing; genetics; tree searching; branch and bound algorithms; genomic information; input sequences; planted (I, d) motif search; transcription factor-binding sites; Planted motif search problem; branch and bound algorithms; challenging instances; exact algorithms; Algorithms; Amino Acid Motifs; Binding Sites; Computational Biology; Models, Statistical; Models, Theoretical; Pattern Recognition, Automated; Protein Binding; Sequence Alignment; Transcription Factors;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2007.70241
Filename
4359890
Link To Document