DocumentCode
3622095
Title
Mining Plausible Patterns from Genomic Data
Author
J. Klema;A. Soulet;B. Cremilleux;S. Blachon;O. Gandrillon
Author_Institution
Université
fYear
2006
fDate
6/28/1905 12:00:00 AM
Firstpage
183
Lastpage
190
Abstract
The discovery of biologically interpretable knowledge from gene expression data is one of the largest contemporary genomic challenges. As large volumes of expression data are being generated, there is a great need for automated tools that provide the means to analyze them. However, the same tools can provide an overwhelming number of candidate hypotheses which can hardly be manually exploited by an expert. An additional knowledge helping to focus automatically on the most plausible candidates only can up-value the experiment significantly. Background knowledge available in literature databases, biological ontologies and other sources can be used for this purpose. In this paper we propose and verify a methodology that enables to effectively mine and represent meaningful over-expression patterns. Each pattern represents a bi-set of a gene group over-expressed in a set of biological situations. The originality of the framework consists in its constraint-based nature and an effective cross-fertilization of constraints based on expression data and background knowledge. The result is a limited set of candidate patterns that are most likely interpretable by biologists. Supplemental automatic interpretations serve to ease this process. Various constraints can generate plausible pattern sets of different characteristics
Keywords
"Genomics","Bioinformatics","Biological information theory","Gene expression","Frequency","Data mining","Databases","Ontologies","Character generation","Pattern analysis"
Publisher
ieee
Conference_Titel
Computer-Based Medical Systems, 2006. CBMS 2006. 19th IEEE International Symposium on
ISSN
1063-7125
Print_ISBN
0-7695-2517-1
Type
conf
DOI
10.1109/CBMS.2006.116
Filename
1647566
Link To Document