Title :
Who´s who? Identifying concepts and entities across multiple documents
Author :
Kazi, Zunaid ; Ravin, Yael
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
A number of research and software development groups have developed technology for identifying terms and names in documents and associating them with concepts and named entries, but few have addressed coreference of concepts and entities across multiple documents in a collection. Cross-document coreference is challenging, since a collection of documents consists of multiple discourse contexts, with a many-to-many correspondence between terms and names on one hand and the concepts and entities they refer to on the other. In this paper we describe extensions to our intra-document term and name identification for coreferencing concepts and entities across documents.
Keywords :
document handling; software engineering; coreference; coreferencing concepts; cross-document coreference; entities; intra-document; multiple documents; software development; Data mining; Identity-based encryption; Information retrieval; Knowledge management; Software libraries; Software packages;
Conference_Titel :
System Sciences, 2000. Proceedings of the 33rd Annual Hawaii International Conference on
Print_ISBN :
0-7695-0493-0
DOI :
10.1109/HICSS.2000.926686