DocumentCode :
3300908
Title :
Automatic construction of biomedical abbreviations dictionary from text
Author :
Quan, Changqin ; Ren, Fuji ; He, Tingting ; Hu, Po
Author_Institution :
Dept. of Comput. Sci., Huazhong Normal Univ., Wuhan
fYear :
2008
fDate :
19-22 Oct. 2008
Firstpage :
1
Lastpage :
5
Abstract :
The size and growth rate of biomedical abbreviation are increasing very fast, automatic construction of biomedical abbreviations dictionary from text helps to understand biomedical literature, and to update existing databases, ontologies, and dictionaries. This paper proposes a new method for automatic construction of biomedical abbreviations dictionary from text by combining string matching algorithm and searching algorithm. The string matching algorithm extracts abbreviations and their longforms. The searching algorithm corrects the false longforms produced by the string matching algorithm. The searching algorithm is based on the idea that readers often lookup relative articles to judge the longform of an abbreviation is correct or not. Our experiments show that the algorithm has high precision (97.5%) and recall (82.2%). And because tagged corpus is not necessary, the method has high efficiency.
Keywords :
data mining; dictionaries; text analysis; biomedical abbreviation dictionary; searching algorithm; string matching algorithm; text mining; Automatic speech recognition; Biomedical engineering; Computer science; Data engineering; Databases; Dictionaries; Intelligent systems; Ontologies; Pattern matching; Systems engineering and theory; Text mining; biomedical abbreviations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-4515-8
Electronic_ISBN :
978-1-4244-2780-2
Type :
conf
DOI :
10.1109/NLPKE.2008.4906784
Filename :
4906784
Link To Document :
بازگشت