Title :
Abbreviation disambiguation using semantic abstraction of symbols and numeric terms
Author :
Kwang Song, Sa ; Jin, Yun ; Myaeng, Sung Hyon
Author_Institution :
Inf. & Commun. Univ., Daejeon, South Korea
fDate :
30 Oct.-1 Nov. 2005
Abstract :
We propose an abbreviation disambiguation approach that utilizes semantic representation of symbols and numeric terms as well as the words in clinical documents. While majority of related works treats symbols and numeric words as stopword, we show that they play an important role especially in coarse-grained documents such as CDA (clinical document architecture) documents, which contain a lot of jargons, symbols, and abbreviations written by doctors. For abbreviation disambiguation task using a classifier, we compared several variations of our approach with a traditional bag-of-words method. The results show that the system using semantic abstraction of symbols and numeric terms can improve the accuracy from 87.9% to 92.6% when a SVM classifier is used.
Keywords :
document handling; knowledge representation; support vector machines; symbol manipulation; SVM classifier; abbreviation disambiguation; clinical document architecture; coarse-grained document; semantic abstraction; semantic representation; support vector machine; Biomedical imaging; Computer science; Data mining; Information retrieval; Machine learning algorithms; Medical diagnostic imaging; Support vector machine classification; Support vector machines; Text categorization; Training data;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598699