DocumentCode
3380796
Title
A statistical technique for bootstrapping available resources for proper nouns classification
Author
Cucchiarelli, Alessandro ; Velardi, Paola
Author_Institution
Ist. di Inf., Ancona Univ., Italy
fYear
1999
fDate
1999
Firstpage
429
Lastpage
435
Abstract
Describes an algorithm for improving the performance of unknown proper noun recognizers, using a statistical framework. We present a bootstrapping technique that starts out by using a training set to acquire contextual classification cues, and then uses the results of the initial phase to acquire additional training data from an unlabeled corpus. The training set (tagged proper nouns in contexts) is obtained trough an application of standard knowledge-based techniques for proper noun tagging, commonly used in information extraction systems
Keywords
context-sensitive grammars; learning (artificial intelligence); natural languages; pattern classification; bootstrapping; contextual classification cues; information extraction systems; proper noun tagging; proper nouns classification; standard knowledge-based techniques; statistical technique; training set; unlabeled corpus; Data mining; Dictionaries; Electronic mail; Information systems; Remuneration; Tagging; Telephony; Text recognition; Thesauri; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Intelligence and Systems, 1999. Proceedings. 1999 International Conference on
Conference_Location
Bethesda, MD
Print_ISBN
0-7695-0446-9
Type
conf
DOI
10.1109/ICIIS.1999.810312
Filename
810312
Link To Document