DocumentCode
3060874
Title
SVMotif: A Machine Learning Motif Algorithm
Author
Kon, Mark ; Fan, Yue ; Holloway, Dustin ; DeLisi, Charles
Author_Institution
Boston Univ., Boston
fYear
2007
fDate
13-15 Dec. 2007
Firstpage
573
Lastpage
580
Abstract
We describe SVMotif, a support vector machine-based learning algorithm for identification of cellular DNA transcription factor (TF) motifs extrapolated from known TF-gene interactions. An important aspect of this procedure is its ability to utilize negative target information (examples of likely non-targets) as well as positive information. Applications involve situations where clusters of genes are distinguished in experiments with known transcription factors without known binding locations. We apply this to yeast TF data with target identifications from ChlP-chip and other sources, and compare performance with Gibbs sampling methods such as BioProspector. We verify that in yeast this method implies well-defined and cross-validated statistical correlations between TF binding and secondary motifs whose binding properties (either with the primary TF or other possible promoters) are not certain, and discuss some implications of this. SVMotif can be a useful standalone method or a complement to existing techniques, and it will be made publicly available.
Keywords
DNA; biology computing; cellular biophysics; correlation methods; learning (artificial intelligence); sampling methods; statistical analysis; support vector machines; BioProspector; ChlP-chip; Gibbs sampling methods; SVMotif; cellular DNA transcription factor motifs; machine learning motif algorithm; negative target information; statistical correlations; support vector machine; Bioinformatics; DNA; Fungi; Gene expression; Learning systems; Machine learning; Machine learning algorithms; Proteins; Regulators; Sampling methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
Conference_Location
Cincinnati, OH
Print_ISBN
978-0-7695-3069-7
Type
conf
DOI
10.1109/ICMLA.2007.105
Filename
4457291
Link To Document