• DocumentCode
    3300908
  • Title

    Automatic construction of biomedical abbreviations dictionary from text

  • Author

    Quan, Changqin ; Ren, Fuji ; He, Tingting ; Hu, Po

  • Author_Institution
    Dept. of Comput. Sci., Huazhong Normal Univ., Wuhan
  • fYear
    2008
  • fDate
    19-22 Oct. 2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    The size and growth rate of biomedical abbreviation are increasing very fast, automatic construction of biomedical abbreviations dictionary from text helps to understand biomedical literature, and to update existing databases, ontologies, and dictionaries. This paper proposes a new method for automatic construction of biomedical abbreviations dictionary from text by combining string matching algorithm and searching algorithm. The string matching algorithm extracts abbreviations and their longforms. The searching algorithm corrects the false longforms produced by the string matching algorithm. The searching algorithm is based on the idea that readers often lookup relative articles to judge the longform of an abbreviation is correct or not. Our experiments show that the algorithm has high precision (97.5%) and recall (82.2%). And because tagged corpus is not necessary, the method has high efficiency.
  • Keywords
    data mining; dictionaries; text analysis; biomedical abbreviation dictionary; searching algorithm; string matching algorithm; text mining; Automatic speech recognition; Biomedical engineering; Computer science; Data engineering; Databases; Dictionaries; Intelligent systems; Ontologies; Pattern matching; Systems engineering and theory; Text mining; biomedical abbreviations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-4515-8
  • Electronic_ISBN
    978-1-4244-2780-2
  • Type

    conf

  • DOI
    10.1109/NLPKE.2008.4906784
  • Filename
    4906784