DocumentCode
2113199
Title
Automatic extraction of abbreviation definitions based on general texts
Author
Zhihua Zhou ; Guang Chen
Author_Institution
Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2013
fDate
23-25 July 2013
Firstpage
853
Lastpage
857
Abstract
The study of abbreviation identifications mostly is limited to the biomedical literature. The wide use of abbreviations in general texts, including web data and newswire data, requires us to process and extract the abbreviation definition. In this paper, we propose an abbreviation definition identification algorithm, which employs a variety of rules and incorporates shallow parsing of the text to identify the most probable abbreviation definition from general texts. The performance of our system was tested with data set provided by 2012 NIST1 TAC-KBP2, obtaining a performance of 94.2% recall and 95.5% precision.
Keywords
pattern recognition; text analysis; NIST; TAC-KBP; abbreviation definition automatic extraction; abbreviation identifications; general texts; shallow parsing; Artificial neural networks; Computational linguistics; Context; Pattern matching; Reliability; Speech; Terminology; Abbreviations; Definitions; General Texts; Pattern Recognition; Rules;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2013 10th International Conference on
Conference_Location
Shenyang
Type
conf
DOI
10.1109/FSKD.2013.6816313
Filename
6816313
Link To Document