DocumentCode
3300908
Title
Automatic construction of biomedical abbreviations dictionary from text
Author
Quan, Changqin ; Ren, Fuji ; He, Tingting ; Hu, Po
Author_Institution
Dept. of Comput. Sci., Huazhong Normal Univ., Wuhan
fYear
2008
fDate
19-22 Oct. 2008
Firstpage
1
Lastpage
5
Abstract
The size and growth rate of biomedical abbreviation are increasing very fast, automatic construction of biomedical abbreviations dictionary from text helps to understand biomedical literature, and to update existing databases, ontologies, and dictionaries. This paper proposes a new method for automatic construction of biomedical abbreviations dictionary from text by combining string matching algorithm and searching algorithm. The string matching algorithm extracts abbreviations and their longforms. The searching algorithm corrects the false longforms produced by the string matching algorithm. The searching algorithm is based on the idea that readers often lookup relative articles to judge the longform of an abbreviation is correct or not. Our experiments show that the algorithm has high precision (97.5%) and recall (82.2%). And because tagged corpus is not necessary, the method has high efficiency.
Keywords
data mining; dictionaries; text analysis; biomedical abbreviation dictionary; searching algorithm; string matching algorithm; text mining; Automatic speech recognition; Biomedical engineering; Computer science; Data engineering; Databases; Dictionaries; Intelligent systems; Ontologies; Pattern matching; Systems engineering and theory; Text mining; biomedical abbreviations;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-4515-8
Electronic_ISBN
978-1-4244-2780-2
Type
conf
DOI
10.1109/NLPKE.2008.4906784
Filename
4906784
Link To Document