• DocumentCode
    4742
  • Title

    Determining Semantically Related Significant Genes

  • Author

    Taha, Kamal

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Khalifa Univ., Abu Dhabi, United Arab Emirates
  • Volume
    11
  • Issue
    6
  • fYear
    2014
  • fDate
    Nov.-Dec. 1 2014
  • Firstpage
    1119
  • Lastpage
    1130
  • Abstract
    GO relation embodies some aspects of existence dependency. If GO term xis existence-dependent on GO term y, the presence of y implies the presence of x. Therefore, the genes annotated with the function of the GO term y are usually functionally and semantically related to the genes annotated with the function of the GO term x. A large number of gene set enrichment analysis methods have been developed in recent years for analyzing gene sets enrichment. However, most of these methods overlook the structural dependencies between GO terms in GO graph by not considering the concept of existence dependency. We propose in this paper a biological search engine called RSGSearch that identifies enriched sets of genes annotated with different functions using the concept of existence dependency. We observe that GO term xcannot be existence-dependent on GO term y, if x- and y- have the same specificity (biological characteristics). After encoding into a numeric format the contributions of GO terms annotating target genes to the semantics of their lowest common ancestors (LCAs), RSGSearch uses microarray experiment to identify the most significant LCA that annotates the result genes. We evaluated RSGSearch experimentally and compared it with five gene set enrichment systems. Results showed marked improvement.
  • Keywords
    bioinformatics; encoding; genetics; search engines; GO graph; GO relation embodies; RSGSearch; biological characteristics; biological search engine; encoding; gene annotation; gene set enrichment analysis; lowest common ancestors; microarray experiment; numeric format; semantically related significant genes; structural dependencies; Bioinformatics; Biological information theory; Biological system modeling; Muscles; Semantics; Statistics; Semantically related genes; gene ontology; gene set enrichment analysis; related genes; semantic similarity;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2014.2344668
  • Filename
    6868276