DocumentCode
615373
Title
Literature mining associations of diseases using gene ontology
Author
Li Yang ; Yanhong Zhou ; Zhuo Tang
Author_Institution
Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
fYear
2013
fDate
26-28 April 2013
Firstpage
789
Lastpage
794
Abstract
Upon the coming of post-genomics era, the research on gene function becomes especially important. Updating of the curated databases is lack of timely. There is a lot of knowledge hidden in biomedicine literatures. With the ever increasing amount of biomedicine literatures, mining the relations automatically is very urgent. Modern research has found most diseases are genetic. The relations between diseases and gene functions are waiting to be mining. This paper proposes a system to mine relations between diseases with common gene functions in the literature with normalized Metathesaurus. First, a two-phase Conditional Random Fields (CRFs) is used to recognize the disease mentions and gene function mentions, including the location and identification. Then, the paper adapts the Disease Ontology (DO) and the Gene Ontology (GO) to annotate the diseases and gene functions recognized for normalization by computing the similarity between mentions and concepts. According to the similarities, the mentions are denoted as concepts and instances distinctively. Thirdly, the paper builds a network and measure relations between diseases by computing similarities between common sub-graphs. The experiments carried out on a corpus randomly selected by GoPubMed with disease and the three domains in GO. The performance shows a lot of hidden relations between diseases and gives an explanation.
Keywords
data mining; diseases; genetics; medical information systems; ontologies (artificial intelligence); CRF; DO; GO; GoPubMed; biomedicine literatures; conditional random fields; curated databases; disease mentions; disease ontology; gene function; gene function mentions; gene ontology; literature mining associations; metathesaurus; post-genomics era; Biology; Biomedical measurement; Diseases; Disease Ontology; Gene Ontology; disease; gene function; sub-graph; text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science & Education (ICCSE), 2013 8th International Conference on
Conference_Location
Colombo
Print_ISBN
978-1-4673-4464-7
Type
conf
DOI
10.1109/ICCSE.2013.6554016
Filename
6554016
Link To Document