Title :
Semantically improved genome-wide prediction of Gene Ontology annotations
Author :
Masseroli, Marco ; Tagliasacchi, Marco ; Chicco, Davide
Author_Institution :
Dipt. di Elettron. e Inf., Politec. di Milano, Milan, Italy
Abstract :
Genomic annotations describing structural and functional features of genes and gene products through controlled terminologies and ontologies are extremely valuable, especially for computational analyses aimed at inferring new biomedical knowledge, which rely on available annotations. Yet, they are incomplete, especially for recently studied genomes, and only some of available annotations represent highly reliable human curated information. In order to help and speedup the time-consuming curation process and improve available annotations, computational methods able to provide prioritized lists of predicted annotations are paramount. Starting from a previous work on automatic prediction of Gene Ontology annotations based on singular value decomposition (SVD) of gene-to-term annotation matrix, here we propose a novel prediction algorithm that incorporates gene clustering based on gene functional similarity computed on Gene Ontology annotations. We tested both prediction methods performing k-fold cross-validation on two organism genomes, Saccharomyces cerevisiae (SGD) and Drosophila melanogaster (FlyBase). Results demonstrate effectiveness of our approach.
Keywords :
bioinformatics; genetics; genomics; ontologies (artificial intelligence); singular value decomposition; Drosophila melanogaster; FlyBase; SGD; SVD; Saccharomyces cerevisiae; automatic prediction; biomedical knowledge; computational analysis; functional feature; gene clustering; gene functional similarity; gene ontology annotation; gene products; gene to term annotation matrix; k-fold cross validation; organism genomes; semantically improved genome wide prediction; singular value decomposition; structural feature; terminologies; Correlation; Databases; Matrix decomposition; Measurement; Ontologies; Organisms; Prediction methods; Annotation prediction; Singular Value Decomposition; gene similarity metrics;
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2011 11th International Conference on
Conference_Location :
Cordoba
Print_ISBN :
978-1-4577-1676-8
DOI :
10.1109/ISDA.2011.6121802