DocumentCode :
2844997
Title :
Using path length measure for gene clustering based on similarity of annotation terms
Author :
Nagar, Anurag ; Al-Mubaid, Hisham
Author_Institution :
Univ. of Houston-Clear Lake, Houston, TX
fYear :
2008
fDate :
6-9 July 2008
Firstpage :
637
Lastpage :
642
Abstract :
The application of semantic similarity measures on gene data using Gene Ontology (GO) and gene annotation information is becoming more widely used and acceptable in the recent years in bioinformatics. The purpose of this application can range from gene similarity to gene clustering. In this paper, we investigate a simple measure for gene similarity that relies on the path length between the GO annotation terms of genes to determine the similarity between them. The similarity values computed by the proposed measure for a set of genes will then be used for clustering the genes. In the evaluation, we compared the proposed measure with two widely used information-theoretic similarity measures, Resnik and Lin, using three datasets of genes. The experimental results and analysis of clusters validated the effectiveness of the proposed path length measure.
Keywords :
biology computing; data analysis; genetics; information theory; pattern clustering; annotation terms similarity; bioinformatics; gene annotation information; gene clustering; gene data; gene ontology; information-theoretic similarity measures; path length measure; semantic similarity measures; Bioinformatics; Biology computing; Biomedical measurements; Clustering algorithms; Clustering methods; Lakes; Length measurement; Ontologies; Proteins; Time measurement; Gene clustering; gene similarity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computers and Communications, 2008. ISCC 2008. IEEE Symposium on
Conference_Location :
Marrakech
ISSN :
1530-1346
Print_ISBN :
978-1-4244-2702-4
Electronic_ISBN :
1530-1346
Type :
conf
DOI :
10.1109/ISCC.2008.4625765
Filename :
4625765
Link To Document :
بازگشت