DocumentCode :
1047351
Title :
Management and Analysis of Genomic Functional and Phenotypic Controlled Annotations to Support Biomedical Investigation and Practice
Author :
Masseroli, Marco
Author_Institution :
Politecnico di Milano, Milan
Volume :
11
Issue :
4
fYear :
2007
fDate :
7/1/2007 12:00:00 AM
Firstpage :
376
Lastpage :
385
Abstract :
The growing available genomic information provides new opportunities for novel research approaches and original biomedical applications that can provide effective data management and analysis support. In fact, integration and comprehensive evaluation of available controlled data can highlight information patterns leading to unveil new biomedical knowledge. Here, we describe Genome Function INtegrated Discover (GFINDer ), a Web-accessible three-tier multidatabase system we developed to automatically enrich lists of user-classified genes with several functional and phenotypic controlled annotations, and to statistically evaluate them in order to identify annotation categories significantly over- or underrepresented in each considered gene class. Genomic controlled annotations from Gene Ontology (GO), KEGG, Pfam, InterPro, and online mendelian Inheritance in Man (OMIM) were integrated in GFINDer and several categorical tests were implemented for their analysis. A controlled vocabulary of inherited disorder phenotypes was obtained by normalizing and hierarchically structuring disease accompanying signs and symptoms from OMIM clinical synopsis sections. GFINDer modular architecture is well suited for further system expansion and for sustaining increasing workload. Testing results showed that GFINDer analyses can highlight gene functional and phenotypic characteristics and differences, demonstrating its value in supporting genomic biomedical approaches aiming at understanding the complex biomolecular mechanisms underlying patho-physiological phenotypes, and in helping the transfer of genomic results to medical practice.
Keywords :
Internet; biomedical engineering; data mining; database management systems; genetic engineering; genetics; medical administrative data processing; ontologies (artificial intelligence); statistical analysis; GFINDer; Gene Ontology; Genome Function INtegrated Discover; InterPro; KEGG; OMIM; OMIM Clinical Synopsis; Online Mendelian Inheritance in Man; Pfam; Web-accessible three-tier multidatabase system; biomedical applications; biomedical databases; biomedical ontologies; data mining; disease; disorder phenotypes; genomic data management; genomic databases; genomic functional annotations; knowledge discovery; modular architecture; patho-physiological phenotypes; phenotypic controlled annotations; statistical analysis; user-classified genes; Automatic control; Bioinformatics; Control systems; Data analysis; Diseases; Genomics; Information analysis; Ontologies; Testing; Vocabulary; Biomedical ontologies; data mining and knowledge discovery; genomic and biomedical databases; genomic data management and statistical analysis; Computational Biology; Database Management Systems; Databases, Genetic; Genomics; Information Storage and Retrieval; Natural Language Processing; Phenotype; Research; User-Computer Interface;
fLanguage :
English
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
Publisher :
ieee
ISSN :
1089-7771
Type :
jour
DOI :
10.1109/TITB.2006.884367
Filename :
4267689
Link To Document :
بازگشت