DocumentCode :
3466821
Title :
Unrestricted Coreference: Identifying Entities and Events in OntoNotes
Author :
Pradhan, Sameer S. ; Ramshaw, Lance ; Weischedel, Ralph ; MacBride, Jessica ; Micciulla, Linnea
Author_Institution :
BBN Technol., Cambridge
fYear :
2007
fDate :
17-19 Sept. 2007
Firstpage :
446
Lastpage :
453
Abstract :
Most research in the field of anaphora or coreference detection has been limited to noun phrase coreference, usually on a restricted set of entities, such as ACE entities. In part, this has been due to the lack of corpus resources tagged with general anaphoric coreference. The OntoNotes project is creating a large-scale, accurate corpus for general anaphoric coreference that covers entities and events not limited to noun phrases or a limited set of entity types. The coreference layer in OntoNotes constitutes one part of a multi-layer, integrated annotation of shallow semantic structure in text. This paper presents an initial model for unrestricted coreference based on this data that uses a machine learning architecture with state-of-the-art features. Significant improvements can be expected from using such cross-layer information for training predictive models. This paper describes the coreference annotation in OntoNotes, presents the baseline model, and provides an analysis of the contribution of this new resource in the context of recent MUC and ACE results.
Keywords :
learning (artificial intelligence); software architecture; text analysis; OntoNotes; automatic content extraction; coreference anaphora; cross-layer information; machine learning architecture; predictive model training; state-of-the-art features; text shallow semantic structure; Context modeling; Event detection; Large-scale systems; Machine learning; Natural languages; Ontologies; Performance analysis; Predictive models; Tagging; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantic Computing, 2007. ICSC 2007. International Conference on
Conference_Location :
Irvine, CA
Print_ISBN :
978-0-7695-2997-4
Type :
conf
DOI :
10.1109/ICSC.2007.93
Filename :
4338380
Link To Document :
بازگشت