Title :
Automatic construction of biomedical abbreviations dictionary from text
Author :
Quan, Changqin ; Ren, Fuji ; He, Tingting ; Hu, Po
Author_Institution :
Dept. of Comput. Sci., Huazhong Normal Univ., Wuhan
Abstract :
The size and growth rate of biomedical abbreviation are increasing very fast, automatic construction of biomedical abbreviations dictionary from text helps to understand biomedical literature, and to update existing databases, ontologies, and dictionaries. This paper proposes a new method for automatic construction of biomedical abbreviations dictionary from text by combining string matching algorithm and searching algorithm. The string matching algorithm extracts abbreviations and their longforms. The searching algorithm corrects the false longforms produced by the string matching algorithm. The searching algorithm is based on the idea that readers often lookup relative articles to judge the longform of an abbreviation is correct or not. Our experiments show that the algorithm has high precision (97.5%) and recall (82.2%). And because tagged corpus is not necessary, the method has high efficiency.
Keywords :
data mining; dictionaries; text analysis; biomedical abbreviation dictionary; searching algorithm; string matching algorithm; text mining; Automatic speech recognition; Biomedical engineering; Computer science; Data engineering; Databases; Dictionaries; Intelligent systems; Ontologies; Pattern matching; Systems engineering and theory; Text mining; biomedical abbreviations;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-4515-8
Electronic_ISBN :
978-1-4244-2780-2
DOI :
10.1109/NLPKE.2008.4906784