• DocumentCode
    2113199
  • Title

    Automatic extraction of abbreviation definitions based on general texts

  • Author

    Zhihua Zhou ; Guang Chen

  • Author_Institution
    Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2013
  • fDate
    23-25 July 2013
  • Firstpage
    853
  • Lastpage
    857
  • Abstract
    The study of abbreviation identifications mostly is limited to the biomedical literature. The wide use of abbreviations in general texts, including web data and newswire data, requires us to process and extract the abbreviation definition. In this paper, we propose an abbreviation definition identification algorithm, which employs a variety of rules and incorporates shallow parsing of the text to identify the most probable abbreviation definition from general texts. The performance of our system was tested with data set provided by 2012 NIST1 TAC-KBP2, obtaining a performance of 94.2% recall and 95.5% precision.
  • Keywords
    pattern recognition; text analysis; NIST; TAC-KBP; abbreviation definition automatic extraction; abbreviation identifications; general texts; shallow parsing; Artificial neural networks; Computational linguistics; Context; Pattern matching; Reliability; Speech; Terminology; Abbreviations; Definitions; General Texts; Pattern Recognition; Rules;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2013 10th International Conference on
  • Conference_Location
    Shenyang
  • Type

    conf

  • DOI
    10.1109/FSKD.2013.6816313
  • Filename
    6816313