DocumentCode
973877
Title
Two-Phase Chief Complaint Mapping to the UMLS Metathesaurus in Korean Electronic Medical Records
Author
Kang, Bo-Yeong ; Kim, Dae-Won ; Kim, Hong-Gee
Author_Institution
Dentistry Coll., Seoul Nat. Univ., Seoul
Volume
13
Issue
1
fYear
2009
Firstpage
78
Lastpage
86
Abstract
The task of automatically determining the concepts referred to in chief complaint (CC) data from electronic medical records (EMRs) is an essential component of many EMR applications aimed at biosurveillance for disease outbreaks. Previous approaches that have been used for this concept mapping have mainly relied on term-level matching, whereby the medical terms in the raw text and their synonyms are matched with concepts in a terminology database. These previous approaches, however, have shortcomings that limit their efficacy in CC concept mapping, where the concepts for CC data are often represented by associative terms rather than by synonyms. Therefore, herein we propose a concept mapping scheme based on a two-phase matching approach, especially for application to Korean CCs, which uses term-level complete matching in the first phase and concept-level matching based on concept learning in the second phase. The proposed concept-level matching suggests the method to learn all the terms (associative terms as well as synonyms) that represent the concept and predict the most probable concept for a CC based on the learned terms. Experiments on 1204 CCs extracted from 15 618 discharge summaries of Korean EMRs showed that the proposed method gave significantly improved F-measure values compared to the baseline system, with improvements of up to 73.57%.
Keywords
database management systems; diseases; medical computing; medical information systems; CC concept mapping; Korean electronic medical records; UMLS Metathesaurus; biosurveillance; disease outbreaks; term-level matching; terminology database; two-phase chief complaint mapping; Chief compliant (CC); Unified Medical Language System (UMLS); concept indexing; concept mapping; information retrieval; machine learning; Abstracting and Indexing as Topic; Algorithms; Artificial Intelligence; Bayes Theorem; Humans; Korea; Medical Informatics; Medical Records Systems, Computerized; Natural Language Processing; Terminology as Topic; Unified Medical Language System;
fLanguage
English
Journal_Title
Information Technology in Biomedicine, IEEE Transactions on
Publisher
ieee
ISSN
1089-7771
Type
jour
DOI
10.1109/TITB.2008.2007103
Filename
4663851
Link To Document