DocumentCode
1863577
Title
Cross-Document Coreference Resolution Based on Automatic Text Summary
Author
Gao, Sanyuan ; Li, Si ; Xu, Weiran ; Guo, Jun
Author_Institution
Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2010
fDate
9-10 Jan. 2010
Firstpage
306
Lastpage
309
Abstract
Cross-document coreference resolution plays an import part in the filed of natural language processing (NLP). It captures the ability of gathering documents for information about a certain entity. Most previous algorithms identify the underlying entity of a given document depending on the original text, which is unreliable if the original text contains multiple parts of different themes. In this paper, we propose a cross-document coreference resolution algorithm based on automatic text summary instead of the original text. In our approach, we extract query-specific and informative-indicative summary from the original text by using Hobbs algorithm and measure the similarity between two summaries. This automatic text summary-based cross-document coreference resolution (ATSCDCR) system is effective in disambiguating different entities of the same mention name and identifying the same entity of different mention names. The results from our experiments show that the macro average of ATSCDCR system is up to 73.16% and the micro average of ATSCDCR system is 67.34 %.
Keywords
natural language processing; text analysis; Hobbs algorithm; automatic text summary; cross-document coreference resolution; informative-indicative summary; natural language processing; query-specific summary; similarity measure; Biomedical informatics; Data mining; Displays; Electronic mail; Intelligent systems; Joining processes; Natural language processing; Pattern recognition; Testing; Text recognition; Automatic Text Summary; Cross-Document Coreference Resolution; Hobbs Algorithm; Named Entity Type Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Knowledge Discovery and Data Mining, 2010. WKDD '10. Third International Conference on
Conference_Location
Phuket
Print_ISBN
978-1-4244-5397-9
Electronic_ISBN
978-1-4244-5398-6
Type
conf
DOI
10.1109/WKDD.2010.56
Filename
5432617
Link To Document