DocumentCode :
2073279
Title :
Detection and normalization of medical terms using domain-specific term frequency and adaptive ranking
Author :
Kim, Mi-Young ; Goebel, Randy
Author_Institution :
Dept. of Comput. Sci., Univ. of Alberta, Edmonton, AB, Canada
fYear :
2010
fDate :
3-5 Nov. 2010
Firstpage :
1
Lastpage :
5
Abstract :
As the volume of clinic notes written in natural language is rapidly increasing, physicians need a tool to automatically extract information about diseases/treatments. The main problem in extracting medical information is that physicians use variant words to describe the same disease/treatment. In order to help physicians interpret and share disease/treatment information in clinic notes, we need to reliably and effectively detect and normalize the medical terms. In this study, we perform detection/normalization of medical terms using a UMLS meta-thesaurus combined with a document retrieval technique. We regard a medical sentence as a query, and a UMLS ontology entry as a document, and try to apply a language modeling-based information retrieval method as currently used in the document retrieval field. Because the term frequency in the UMLS dictionary is uniform, we employ a domain-specific term frequency instead of traditional term frequency. To retrieve only the relevant terms in 900,000 UMLS entries, we also propose an adaptive ranking method which dynamically determines the relevant documents for each query without using static cut-off threshold. The experimental results outperform the previous methods in detecting and normalizing medical terms in Medline clinical trials, and our approach can be used in normalizing the real diagnosis list in the patient charts of physicians.
Keywords :
Unified Modeling Language; diseases; medical computing; medical information systems; ontologies (artificial intelligence); query processing; thesauri; Medline clinical trials; UMLS metathesaurus; adaptive ranking; clinic notes; diagnosis list; disease treatment; document retrieval; domain-specific term frequency; information retrieval; language modeling; medical information extraction; medical terms detection; medical terms normalization; ontology; query; static cut-off threshold; Diseases; Drugs; Lungs; Medical diagnostic imaging; Muscles; Unified modeling language;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology and Applications in Biomedicine (ITAB), 2010 10th IEEE International Conference on
Conference_Location :
Corfu
Print_ISBN :
978-1-4244-6559-0
Type :
conf
DOI :
10.1109/ITAB.2010.5687670
Filename :
5687670
Link To Document :
بازگشت