Title :
Two-Phase Chief Complaint Mapping to the UMLS Metathesaurus in Korean Electronic Medical Records
Author :
Kang, Bo-Yeong ; Kim, Dae-Won ; Kim, Hong-Gee
Author_Institution :
Dentistry Coll., Seoul Nat. Univ., Seoul
Abstract :
The task of automatically determining the concepts referred to in chief complaint (CC) data from electronic medical records (EMRs) is an essential component of many EMR applications aimed at biosurveillance for disease outbreaks. Previous approaches that have been used for this concept mapping have mainly relied on term-level matching, whereby the medical terms in the raw text and their synonyms are matched with concepts in a terminology database. These previous approaches, however, have shortcomings that limit their efficacy in CC concept mapping, where the concepts for CC data are often represented by associative terms rather than by synonyms. Therefore, herein we propose a concept mapping scheme based on a two-phase matching approach, especially for application to Korean CCs, which uses term-level complete matching in the first phase and concept-level matching based on concept learning in the second phase. The proposed concept-level matching suggests the method to learn all the terms (associative terms as well as synonyms) that represent the concept and predict the most probable concept for a CC based on the learned terms. Experiments on 1204 CCs extracted from 15 618 discharge summaries of Korean EMRs showed that the proposed method gave significantly improved F-measure values compared to the baseline system, with improvements of up to 73.57%.
Keywords :
database management systems; diseases; medical computing; medical information systems; CC concept mapping; Korean electronic medical records; UMLS Metathesaurus; biosurveillance; disease outbreaks; term-level matching; terminology database; two-phase chief complaint mapping; Chief compliant (CC); Unified Medical Language System (UMLS); concept indexing; concept mapping; information retrieval; machine learning; Abstracting and Indexing as Topic; Algorithms; Artificial Intelligence; Bayes Theorem; Humans; Korea; Medical Informatics; Medical Records Systems, Computerized; Natural Language Processing; Terminology as Topic; Unified Medical Language System;
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
DOI :
10.1109/TITB.2008.2007103