• DocumentCode
    1899974
  • Title

    Transcription Factor Discovery using Support Vector Machines and Heterogeneous Data

  • Author

    Barbe, J.F. ; Tewfik, Ahmed H. ; Khodursky, Arkady B.

  • Author_Institution
    Univ. of Minnesota, Minneapolis
  • fYear
    2007
  • fDate
    10-12 June 2007
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this work we analyze the suitability of expression and sequence data for discovery of co-regulatory relationships using Support Vector Machines. In addition, we try to assess the possibility of improving such results by heterogeneous data fusion and by estimating a probability of a correct classification. As shown in other studies, we have found that transcription co-expression is a good estimator for genetic co-regulation. We also have found some evidence that operator site sequence motifs can be used to estimate co-regulation, but the kernels used for feature extraction did not achieve classification rates comparable to expression data. Finally, the additional information provided by combining sequence and expression data can be exploited to estimate the probability of correct classification.
  • Keywords
    probability; support vector machines; coregulatory relationships; heterogeneous data; operator site sequence; probability estimation; sequence data; support vector machines; transcription factor discovery; DNA; Feature extraction; Hidden Markov models; Kernel; Polymers; Probability; RNA; Sequences; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genomic Signal Processing and Statistics, 2007. GENSIPS 2007. IEEE International Workshop on
  • Conference_Location
    Tuusula
  • Print_ISBN
    978-1-4244-0998-3
  • Electronic_ISBN
    978-1-4244-0999-0
  • Type

    conf

  • DOI
    10.1109/GENSIPS.2007.4365812
  • Filename
    4365812