• DocumentCode
    3060874
  • Title

    SVMotif: A Machine Learning Motif Algorithm

  • Author

    Kon, Mark ; Fan, Yue ; Holloway, Dustin ; DeLisi, Charles

  • Author_Institution
    Boston Univ., Boston
  • fYear
    2007
  • fDate
    13-15 Dec. 2007
  • Firstpage
    573
  • Lastpage
    580
  • Abstract
    We describe SVMotif, a support vector machine-based learning algorithm for identification of cellular DNA transcription factor (TF) motifs extrapolated from known TF-gene interactions. An important aspect of this procedure is its ability to utilize negative target information (examples of likely non-targets) as well as positive information. Applications involve situations where clusters of genes are distinguished in experiments with known transcription factors without known binding locations. We apply this to yeast TF data with target identifications from ChlP-chip and other sources, and compare performance with Gibbs sampling methods such as BioProspector. We verify that in yeast this method implies well-defined and cross-validated statistical correlations between TF binding and secondary motifs whose binding properties (either with the primary TF or other possible promoters) are not certain, and discuss some implications of this. SVMotif can be a useful standalone method or a complement to existing techniques, and it will be made publicly available.
  • Keywords
    DNA; biology computing; cellular biophysics; correlation methods; learning (artificial intelligence); sampling methods; statistical analysis; support vector machines; BioProspector; ChlP-chip; Gibbs sampling methods; SVMotif; cellular DNA transcription factor motifs; machine learning motif algorithm; negative target information; statistical correlations; support vector machine; Bioinformatics; DNA; Fungi; Gene expression; Learning systems; Machine learning; Machine learning algorithms; Proteins; Regulators; Sampling methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
  • Conference_Location
    Cincinnati, OH
  • Print_ISBN
    978-0-7695-3069-7
  • Type

    conf

  • DOI
    10.1109/ICMLA.2007.105
  • Filename
    4457291