DocumentCode
3240961
Title
Coreference resolution in biomedical full-text articles with domain dependent features
Author
Huang, Cuili ; Wang, Yaqiang ; Zhang, Yongmei ; Jin, Yu ; Yu, Zhonghua
Author_Institution
Coll. of Comput. Sci., Sichuan Univ., Chengdu, China
fYear
2010
fDate
2-4 Nov. 2010
Firstpage
616
Lastpage
620
Abstract
Coreference resolution is one of the most significant and difficult tasks in text mining. This paper proposes an algorithm to find coreference relations in biomedical papers. We concentrate on noun phrases referring to biomedical entities and do our research on biomedical full-text articles instead of paper abstract. The algorithm not only uses features about linguistic but also introduces features derived from biomedical domain knowledge to deeply reveal the coreference relation between biomedical noun phrases. These features are employed to a probabilistic-based classifier which takes the dependence of features into account. Experiments show that the features are helpful and the algorithm is effective for coreference resolution in the biomedical domain. On a full-text corpus with anaphoric links, a satisfactory experimental result (76.2% precision, 66.0% recall and 70.7% F-Measure) is achieved by the proposed algorithm. Due to appropriate features, there is 13.8% improvement yielded against the former algorithm.
Keywords
data mining; medical administrative data processing; pattern classification; probability; text analysis; biomedical domain knowledge; biomedical full-text article; biomedical noun phrase; coreference resolution; domain dependent feature; probabilistic based classifier; text mining; Biology; Biomedical measurements; Equations; Semantics; coreference resolution; domain dependent features; full-text articles; gene name;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Technology and Development (ICCTD), 2010 2nd International Conference on
Conference_Location
Cairo
Print_ISBN
978-1-4244-8844-5
Electronic_ISBN
978-1-4244-8845-2
Type
conf
DOI
10.1109/ICCTD.2010.5645973
Filename
5645973
Link To Document