• DocumentCode
    2844974
  • Title

    Comparison of four similarity measures based on GO annotations for Gene Clustering

  • Author

    Al Mubaid, H. ; Nagar, Anurag

  • Author_Institution
    Univ. of Houston-Clear Lake, Houston, TX
  • fYear
    2008
  • fDate
    6-9 July 2008
  • Firstpage
    531
  • Lastpage
    536
  • Abstract
    Gene ontology (GO) has fast become a dependable source for determining gene functions, gene similarity, and gene clustering. Furthermore, using GO and gene annotation databases, with semantic similarity measures, is now more acceptable in bioinformatics as means for gene functional analysis. In this paper, we compare four semantic similarity measures to compute the similarity between genes using GO annotations within a gene clustering application. The similarity measures we examined in this study rely on different information sources and techniques in computing the similarity between genes. For example, we selected two measures that are based on information-content, one measure is based on ontology-structure, and the fourth measure is based on annotation terms within the GO hierarchy. In the evaluation, we used five gene datasets from yeast genome, and we analyzed the results based on various clustering metrics and criteria.
  • Keywords
    biology computing; database management systems; genetics; ontologies (artificial intelligence); pattern clustering; bioinformatics; clustering metrics; gene annotation databases; gene clustering; gene functional analysis; gene ontology annotations; gene similarity; semantic similarity measures; yeast genome; Bioinformatics; Databases; Functional analysis; Fungi; Genomics; Lakes; Length measurement; Ontologies; Proteins; Time measurement; GO clustering; Gene clustering; gene similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computers and Communications, 2008. ISCC 2008. IEEE Symposium on
  • Conference_Location
    Marrakech
  • ISSN
    1530-1346
  • Print_ISBN
    978-1-4244-2702-4
  • Electronic_ISBN
    1530-1346
  • Type

    conf

  • DOI
    10.1109/ISCC.2008.4625763
  • Filename
    4625763