• DocumentCode
    615373
  • Title

    Literature mining associations of diseases using gene ontology

  • Author

    Li Yang ; Yanhong Zhou ; Zhuo Tang

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
  • fYear
    2013
  • fDate
    26-28 April 2013
  • Firstpage
    789
  • Lastpage
    794
  • Abstract
    Upon the coming of post-genomics era, the research on gene function becomes especially important. Updating of the curated databases is lack of timely. There is a lot of knowledge hidden in biomedicine literatures. With the ever increasing amount of biomedicine literatures, mining the relations automatically is very urgent. Modern research has found most diseases are genetic. The relations between diseases and gene functions are waiting to be mining. This paper proposes a system to mine relations between diseases with common gene functions in the literature with normalized Metathesaurus. First, a two-phase Conditional Random Fields (CRFs) is used to recognize the disease mentions and gene function mentions, including the location and identification. Then, the paper adapts the Disease Ontology (DO) and the Gene Ontology (GO) to annotate the diseases and gene functions recognized for normalization by computing the similarity between mentions and concepts. According to the similarities, the mentions are denoted as concepts and instances distinctively. Thirdly, the paper builds a network and measure relations between diseases by computing similarities between common sub-graphs. The experiments carried out on a corpus randomly selected by GoPubMed with disease and the three domains in GO. The performance shows a lot of hidden relations between diseases and gives an explanation.
  • Keywords
    data mining; diseases; genetics; medical information systems; ontologies (artificial intelligence); CRF; DO; GO; GoPubMed; biomedicine literatures; conditional random fields; curated databases; disease mentions; disease ontology; gene function; gene function mentions; gene ontology; literature mining associations; metathesaurus; post-genomics era; Biology; Biomedical measurement; Diseases; Disease Ontology; Gene Ontology; disease; gene function; sub-graph; text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science & Education (ICCSE), 2013 8th International Conference on
  • Conference_Location
    Colombo
  • Print_ISBN
    978-1-4673-4464-7
  • Type

    conf

  • DOI
    10.1109/ICCSE.2013.6554016
  • Filename
    6554016