Title :
Learning relationships between over-represented motifs in a set of DNA sequences
Author :
Korol, Oksana ; Turcotte, Marcel
Author_Institution :
Sch. of Electr. Eng. & Comput. Sci., Univ. of Ottawa, Ottawa, ON, Canada
Abstract :
Finding relationships between DNA sequence motifs, such as transcription factor binding sites, is an important step to understand transcription regulation in a particular context. Current computational tools are not well adapted for discovering relationships. We have developed a software system, ModuleInducer, which integrates motif finding with the analysis of possible interactions between them in the set of related DNA sequences using inductive logic programming. Our method was tested on synthetic and two kinds of real biological data. It has been shown to perform well as a cis-regulatory module finder as well as a knowledge mining tool for ChIP-Sequencing data analysis. Our method has proven to be of high suggestive value for future research by uncovering novel motif interactions in ChIP-Seq data, missed in the original study. ModuleInducer is available at: http://induce.eecs.uottawa.ca.
Keywords :
DNA; bioinformatics; biological techniques; data mining; inductive logic programming; learning (artificial intelligence); molecular biophysics; molecular configurations; ChIP-Seq data; ChIP-Sequencing data analysis; DNA sequence relationship learning; ModuleInducer software system; cis-regulatory module finder; inductive logic programming; knowledge mining tool; motif finding; motif interactions; over-represented DNA sequence motifs; related DNA sequences; transcription factor binding sites; transcription regulation; Biomarkers; Context; DNA; Educational institutions; Engines; User interfaces; binding sites; inductive logic programming; relationships; spatial motifs; transcription factors/metabolism; transcriptional regulation; transcriptional regulatory elements;
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2012 IEEE Symposium on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4673-1190-8
DOI :
10.1109/CIBCB.2012.6217223