DocumentCode
1899974
Title
Transcription Factor Discovery using Support Vector Machines and Heterogeneous Data
Author
Barbe, J.F. ; Tewfik, Ahmed H. ; Khodursky, Arkady B.
Author_Institution
Univ. of Minnesota, Minneapolis
fYear
2007
fDate
10-12 June 2007
Firstpage
1
Lastpage
4
Abstract
In this work we analyze the suitability of expression and sequence data for discovery of co-regulatory relationships using Support Vector Machines. In addition, we try to assess the possibility of improving such results by heterogeneous data fusion and by estimating a probability of a correct classification. As shown in other studies, we have found that transcription co-expression is a good estimator for genetic co-regulation. We also have found some evidence that operator site sequence motifs can be used to estimate co-regulation, but the kernels used for feature extraction did not achieve classification rates comparable to expression data. Finally, the additional information provided by combining sequence and expression data can be exploited to estimate the probability of correct classification.
Keywords
probability; support vector machines; coregulatory relationships; heterogeneous data; operator site sequence; probability estimation; sequence data; support vector machines; transcription factor discovery; DNA; Feature extraction; Hidden Markov models; Kernel; Polymers; Probability; RNA; Sequences; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Genomic Signal Processing and Statistics, 2007. GENSIPS 2007. IEEE International Workshop on
Conference_Location
Tuusula
Print_ISBN
978-1-4244-0998-3
Electronic_ISBN
978-1-4244-0999-0
Type
conf
DOI
10.1109/GENSIPS.2007.4365812
Filename
4365812
Link To Document