DocumentCode :
112059
Title :
RGFinder: A System for Determining Semantically Related Genes Using GO Graph Minimum Spanning Tree
Author :
Taha, Kamal
Author_Institution :
Dept. of Electr. & Comput. Eng., Khalifa Univ., Abu Dhabi, United Arab Emirates
Volume :
14
Issue :
1
fYear :
2015
fDate :
Jan. 2015
Firstpage :
24
Lastpage :
37
Abstract :
Biologists often need to know the set S´ of genes that are the most functionally and semantically related to a given set S of genes. For determining the set S´, most current gene similarity measures overlook the structural dependencies among the Gene Ontology (GO) terms annotating the set S, which may lead to erroneous results. We introduce in this paper a biological search engine called RGFinder that considers the structural dependencies among GO terms by employing the concept of existence dependency. RGFinder assigns a weight to each edge in GO graph to represent the degree of relatedness between the two GO terms connected by the edge. The value of the weight is determined based on the following factors: 1) type of the relation represented by the edge (e.g., an “is-a” relation is assigned a different weight than a “part-of” relation), 2) the functional relationship between the two GO terms connected by the edge, and 3) the string-substring relationship between the names of the two GO terms connected by the edge. RGFinder then constructs a minimum spanning tree of GO graph based on these weights. In the framework of RGFinder, the set S´ is annotated to the GO terms located at the lowest convergences of the subtree of the minimum spanning tree that passes through the GO terms annotating set S. We evaluated RGFinder experimentally and compared it with four gene set enrichment systems. Results showed marked improvement.
Keywords :
biology computing; ontologies (artificial intelligence); search engines; trees (mathematics); GO graph minimum spanning tree; RGFinder system; biological search engine; gene ontology; gene set enrichment system; semantically related genes; string-substring relationship; Current measurement; Joining processes; Nanobioscience; Semantics; Sociology; Statistics; Biological search engine; GO term; Gene Ontology; related genes; related terms; semantic similarity;
fLanguage :
English
Journal_Title :
NanoBioscience, IEEE Transactions on
Publisher :
ieee
ISSN :
1536-1241
Type :
jour
DOI :
10.1109/TNB.2014.2363295
Filename :
6926848
Link To Document :
بازگشت