• DocumentCode
    419328
  • Title

    MISAE: a new approach for regulatory motif extraction

  • Author

    Sun, Zhaohui ; Yang, Jingyi ; Deogun, Jitender S.

  • Author_Institution
    Nebraska Univ., Lincoln, NE, USA
  • fYear
    2004
  • fDate
    16-19 Aug. 2004
  • Firstpage
    173
  • Lastpage
    181
  • Abstract
    The recognition of regulatory motifs of co-regulated genes is essential for understanding the regulatory mechanisms. However, the automatic extraction of regulatory motifs from a given data set of the upstream noncoding DNA sequences of a family of co-regulated genes is difficult because regulatory motifs are often subtle and inexact. This problem is further complicated by the corruption of the data sets. In this paper, a new approach called mismatch-allowed probabilistic suffix tree motif extraction (MISAE) is proposed. It combines the mismatch-allowed probabilistic suffix tree that is a probabilistic model and local prediction for the extraction of regulatory motifs. The proposed approach is tested on 15 co-regulated gene families and compares favorably with other state-of-the-art approaches. Moreover, MISAE performs well on "corrupted" data sets. It is able to extract the motif from a "corrupted" data set with less than one fourth of the sequences containing the real motif.
  • Keywords
    DNA; biology computing; genetics; molecular biophysics; trees (mathematics); MISAE; coregulated genes; mismatch-allowed probabilistic suffix tree motif extraction; regulatory motif extraction; upstream noncoding DNA sequences; Bioinformatics; Computer science; DNA; Data mining; Gene expression; Genomics; Predictive models; Sequences; Sun; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
  • Print_ISBN
    0-7695-2194-0
  • Type

    conf

  • DOI
    10.1109/CSB.2004.1332430
  • Filename
    1332430