Title :
Automatic extraction of abbreviation definitions based on general texts
Author :
Zhihua Zhou ; Guang Chen
Author_Institution :
Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
The study of abbreviation identifications mostly is limited to the biomedical literature. The wide use of abbreviations in general texts, including web data and newswire data, requires us to process and extract the abbreviation definition. In this paper, we propose an abbreviation definition identification algorithm, which employs a variety of rules and incorporates shallow parsing of the text to identify the most probable abbreviation definition from general texts. The performance of our system was tested with data set provided by 2012 NIST1 TAC-KBP2, obtaining a performance of 94.2% recall and 95.5% precision.
Keywords :
pattern recognition; text analysis; NIST; TAC-KBP; abbreviation definition automatic extraction; abbreviation identifications; general texts; shallow parsing; Artificial neural networks; Computational linguistics; Context; Pattern matching; Reliability; Speech; Terminology; Abbreviations; Definitions; General Texts; Pattern Recognition; Rules;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2013 10th International Conference on
Conference_Location :
Shenyang
DOI :
10.1109/FSKD.2013.6816313