• DocumentCode
    2227923
  • Title

    A sequential Monte Carlo method for motif discovery

  • Author

    Kuo-ching Liang ; Xiaodong Wang ; Anastassiou, Dimitris

  • Author_Institution
    Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
  • fYear
    2006
  • fDate
    4-8 Sept. 2006
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    We propose a sequential Monte Carlo (SMC)-based motif discovery algorithm that can efficiently detect motifs in datasets containing a large number of sequences. The statistical distribution of the motifs and the positions of the motifs within the sequences are estimated by the SMC algorithm. The proposed SMC motif discovery technique can locate motifs under a number of scenarios, including the single-block model, two-block model with unknown gap length, motifs of unknown lengths, motifs with unknown abundance, and sequences with multiple unique motifs. The accuracy of the SMC motif discovery algorithm is shown to be superior to that of the existing methods based on MCMC or EM algorithms. Furthermore, it is shown that the proposed method can be used to improve the results of existing motif discovery algorithms by using their results as the priors for the SMC algorithm.
  • Keywords
    DNA; Markov processes; Monte Carlo methods; biological techniques; genomics; statistical distributions; EM algorithm; MCMC algorithm; SMC algorithm; motif discovery algorithm; sequential Monte Carlo method; single-block model; statistical distribution; two-block model; unknown gap length; Abstracts; Bioinformatics; Genomics; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2006 14th European
  • Conference_Location
    Florence
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7071752