• DocumentCode
    973877
  • Title

    Two-Phase Chief Complaint Mapping to the UMLS Metathesaurus in Korean Electronic Medical Records

  • Author

    Kang, Bo-Yeong ; Kim, Dae-Won ; Kim, Hong-Gee

  • Author_Institution
    Dentistry Coll., Seoul Nat. Univ., Seoul
  • Volume
    13
  • Issue
    1
  • fYear
    2009
  • Firstpage
    78
  • Lastpage
    86
  • Abstract
    The task of automatically determining the concepts referred to in chief complaint (CC) data from electronic medical records (EMRs) is an essential component of many EMR applications aimed at biosurveillance for disease outbreaks. Previous approaches that have been used for this concept mapping have mainly relied on term-level matching, whereby the medical terms in the raw text and their synonyms are matched with concepts in a terminology database. These previous approaches, however, have shortcomings that limit their efficacy in CC concept mapping, where the concepts for CC data are often represented by associative terms rather than by synonyms. Therefore, herein we propose a concept mapping scheme based on a two-phase matching approach, especially for application to Korean CCs, which uses term-level complete matching in the first phase and concept-level matching based on concept learning in the second phase. The proposed concept-level matching suggests the method to learn all the terms (associative terms as well as synonyms) that represent the concept and predict the most probable concept for a CC based on the learned terms. Experiments on 1204 CCs extracted from 15 618 discharge summaries of Korean EMRs showed that the proposed method gave significantly improved F-measure values compared to the baseline system, with improvements of up to 73.57%.
  • Keywords
    database management systems; diseases; medical computing; medical information systems; CC concept mapping; Korean electronic medical records; UMLS Metathesaurus; biosurveillance; disease outbreaks; term-level matching; terminology database; two-phase chief complaint mapping; Chief compliant (CC); Unified Medical Language System (UMLS); concept indexing; concept mapping; information retrieval; machine learning; Abstracting and Indexing as Topic; Algorithms; Artificial Intelligence; Bayes Theorem; Humans; Korea; Medical Informatics; Medical Records Systems, Computerized; Natural Language Processing; Terminology as Topic; Unified Medical Language System;
  • fLanguage
    English
  • Journal_Title
    Information Technology in Biomedicine, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1089-7771
  • Type

    jour

  • DOI
    10.1109/TITB.2008.2007103
  • Filename
    4663851