DocumentCode
2487837
Title
Gene function prediction using protein domain probability and hierarchical Gene Ontology information
Author
Jung, Jaehee ; Thon, Michael R.
Author_Institution
Dept. of Comput. Sci., Texas A&M Univ., College Station, TX
fYear
2008
fDate
8-11 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
The Gene Ontology (GO) is a controlled vocabulary of terms to describe protein functions. It also includes a hierarchical description of the relationships among the terms in the form of a directed acyclic graph (DAG). Several systems have been developed that employ pattern recognition to assign gene function, using a variety of features, including sequence similarity, presence of protein functional domains and gene expression patterns, but most of these approaches have not considered the hierarchical structure of the GO. The DAG represents the functional relationships between the GO terms, thus it should be an important component of an automated annotation system. We propose a Bayesian, multi-label classifier that incorporates the relationships among GO terms found in the GO DAG. A comparative analysis of our method to other previously described annotation systems shows that our method provides improved annotation accuracy when the performance of individual GO terms are compared. More importantly, our method enables the classification of significantly more GO terms to more proteins than were previously possible.
Keywords
Bayes methods; biology computing; directed graphs; genetics; ontologies (artificial intelligence); pattern classification; proteins; Bayesian classifier; automated annotation system; directed acyclic graph; gene expression pattern; gene function prediction; hierarchical gene ontology information; multi-label classifier; pattern recognition; protein domain probability; protein functional domains; protein functions; sequence similarity; Automatic control; Bayesian methods; Bioinformatics; Gene expression; Genomics; Ontologies; Pediatrics; Probability; Protein engineering; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location
Tampa, FL
ISSN
1051-4651
Print_ISBN
978-1-4244-2174-9
Electronic_ISBN
1051-4651
Type
conf
DOI
10.1109/ICPR.2008.4761737
Filename
4761737
Link To Document