• DocumentCode
    2770896
  • Title

    Discovering DNA Motifs with Nucleotide Dependency

  • Author

    Leung, Henry C M ; Chin, Francis Y L

  • Author_Institution
    Dept. of Comput. Sci., Hong Kong Univ.
  • fYear
    2006
  • fDate
    16-18 Oct. 2006
  • Firstpage
    70
  • Lastpage
    80
  • Abstract
    The problem of finding motifs of binding sites is very important to the understanding of gene regulatory networks. Motifs are generally represented by matrices (PWM or PSSM) or strings. However, these representations cannot model biological binding sites well because they fail to capture nucleotide interdependence. It has been pointed out by many researchers that the nucleotides of the DNA binding site cannot be treated independently, e.g. the binding of zinc finger in proteins. In this paper, a new representation called Scored Position Specific Pattern (SPSP), which is a generalization of the matrix and string representations, is introduced which takes into consideration the dependent occurrences of neighboring nucleotides. Even though the problem of finding the optimal motif in SPSP representation is proved to be NP-hard, we introduce a heuristic algorithm called SPSP-Finder, which can effectively find optimal motifs in most simulated cases and some real cases for which existing popular motif-finding software, such as MEME and AlignACE fail
  • Keywords
    DNA; biochemistry; biology computing; computational complexity; genetics; matrix algebra; molecular biophysics; optimisation; proteins; DNA binding site; NP-hard problem; biological binding sites; gene regulatory networks; heuristic algorithm; matrix generalization; motif-finding software; nucleotide; position specific scoring matrices; position weight matrices; proteins; regulatory DNA sequence motifs discovery; scored position specific pattern-finder; string representation; zinc finger; Biological system modeling; Computer science; DNA; Fingers; Heuristic algorithms; Hidden Markov models; Proteins; Pulse width modulation; Software algorithms; Zinc;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    BioInformatics and BioEngineering, 2006. BIBE 2006. Sixth IEEE Symposium on
  • Conference_Location
    Arlington, VA
  • Print_ISBN
    0-7695-2727-2
  • Type

    conf

  • DOI
    10.1109/BIBE.2006.253318
  • Filename
    4019643