DocumentCode
2138170
Title
HaDextract: Extracting HLA-Disease Interaction Information from Biomedical Literature
Author
Chae, JeongMin ; Park, Kinam ; Jung, YoungHee ; Jung, Soonyoung ; Chae, JiEun ; Oh, HeungBum
Author_Institution
Dept. of Comput. Educ., Korea Univ., Seoul, South Korea
Volume
3
fYear
2008
fDate
13-15 Dec. 2008
Firstpage
90
Lastpage
95
Abstract
The HLA control a variety of function involved in immune response and influence susceptibility to over 40 diseases. It is important to find out how HLA cause the disease or modify susceptibility or course of it. In this paper, we developed an automatic HLA-disease information extraction procedure that uses biomedical publications. First, HLA and diseases are recognized in the literature using built-in regular languages and disease categories of Mesh. Second, we generated parse trees for each sentence in PubMed using collins parser. Third, we build our own information extraction algorithm. The algorithm searched parsing trees and extracted relation information from sentences. The precision rate of extracted relations reported 89.6 in randomly selected 144 sentences.
Keywords
diseases; grammars; medical information systems; molecular biophysics; search engines; HLA control; HLA-disease interaction; HaDextract; Mesh; PubMed; biomedical literature; collins parser; human leukocyte antigen system; immune response; influence susceptibility; information extraction algorithm; Biomedical computing; Computer networks; Computer science education; Data mining; Diseases; Frequency; Humans; Immune system; Peptides; Proteins; HLA; disease; interaction information; text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Future Generation Communication and Networking, 2008. FGCN '08. Second International Conference on
Conference_Location
Hainan Island
Print_ISBN
978-0-7695-3431-2
Type
conf
DOI
10.1109/FGCN.2008.161
Filename
4734286
Link To Document