• DocumentCode
    952248
  • Title

    Fast and Practical Algorithms for Planted (l, d) Motif Search

  • Author

    Davila, Jaime ; Balla, Sudha ; Rajasekaran, Sanguthevar

  • Author_Institution
    Univ. of Connecticut, Storrs
  • Volume
    4
  • Issue
    4
  • fYear
    2007
  • Firstpage
    544
  • Lastpage
    552
  • Abstract
    We consider the planted (I, d) motif search problem, which consists of finding a substring of length I that occurs in a set of input sequences {si,. ..,sn} with up to d errors, a problem that arises from the need to find transcription factor-binding sites in genomic information. We propose a sequence of practical algorithms, which start based on the ideas considered in PMS1. These algorithms are exact, have little space requirements, and are able to tackle challenging instances with bigger d, taking less time in the instances reported solved by exact algorithms. In particular, one of the proposed algorithms, PMSprune, is able to solve the challenging instances, such as (17, 6) and (19, 7), which were not previously reported as solved in the literature.
  • Keywords
    biology computing; genetics; tree searching; branch and bound algorithms; genomic information; input sequences; planted (I, d) motif search; transcription factor-binding sites; Planted motif search problem; branch and bound algorithms; challenging instances; exact algorithms; Algorithms; Amino Acid Motifs; Binding Sites; Computational Biology; Models, Statistical; Models, Theoretical; Pattern Recognition, Automated; Protein Binding; Sequence Alignment; Transcription Factors;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2007.70241
  • Filename
    4359890