DocumentCode :
652110
Title :
Using Candidate Exploration and Ranking for Abbreviation Resolution in Clinical Document
Author :
Jong-Beom Kim ; Heung-Seon Oh ; Sang-Soo Nam ; Sung-Hyon Myaeng
Author_Institution :
Mobile Commun. Co., LG Electron. Inc., Seoul, South Korea
fYear :
2013
fDate :
9-11 Sept. 2013
Firstpage :
317
Lastpage :
326
Abstract :
In biomedical texts, abbreviations are frequently used due to their inclusion of many technical expressions of some length. Accordingly, appropriate recognition of abbreviations and their full form pairs is an essential task in automatic text processing of biomedical documents. However, unlike the biomedical literature, clinical notes have many abbreviations without their full forms available in the text or without standard definitions in dictionaries due to the nature of the documents. This causes difficulties in adapting traditional approaches for abbreviation disambiguation such as classification among fixed candidates or pattern-based definition extraction. Because of this reason, we consider the task as search problem and propose an approach with two steps: a) exploring possible full form candidates from various resources and b) choosing most acceptable one among retrieved candidates by ranking. To discover full form candidates and their features, we exploited external academic resources such as MEDLINE and UMLS as well as the clinical note corpus itself. To rank the candidates properly based on human criteria, we adopted Rank Boost, one of the learning-to-rank models developed from information retrieval and machine learning communities. Experimental results show the suggested two-step approach is promising for this kind of task.
Keywords :
Unified Modeling Language; dictionaries; document handling; information retrieval; learning (artificial intelligence); medical computing; medical information systems; MEDLINE; Rank Boost; UMLS; abbreviation disambiguation; abbreviation resolution; automatic text processing; biomedical documents; biomedical literature; biomedical texts; candidate exploration; clinical document; clinical note corpus; clinical notes; dictionary; external academic resources; human criteria; information retrieval; learning-to-rank models; machine learning community; pattern-based definition extraction; retrieved candidates; search problem; Communities; Dictionaries; Medical diagnostic imaging; Standards; Text recognition; Unified modeling language; Vocabulary; Abbreviation Resolution; Learning to Rank; Medical Text Processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Healthcare Informatics (ICHI), 2013 IEEE International Conference on
Conference_Location :
Philadelphia, PA
Type :
conf
DOI :
10.1109/ICHI.2013.44
Filename :
6680492
Link To Document :
بازگشت