• DocumentCode
    652110
  • Title

    Using Candidate Exploration and Ranking for Abbreviation Resolution in Clinical Document

  • Author

    Jong-Beom Kim ; Heung-Seon Oh ; Sang-Soo Nam ; Sung-Hyon Myaeng

  • Author_Institution
    Mobile Commun. Co., LG Electron. Inc., Seoul, South Korea
  • fYear
    2013
  • fDate
    9-11 Sept. 2013
  • Firstpage
    317
  • Lastpage
    326
  • Abstract
    In biomedical texts, abbreviations are frequently used due to their inclusion of many technical expressions of some length. Accordingly, appropriate recognition of abbreviations and their full form pairs is an essential task in automatic text processing of biomedical documents. However, unlike the biomedical literature, clinical notes have many abbreviations without their full forms available in the text or without standard definitions in dictionaries due to the nature of the documents. This causes difficulties in adapting traditional approaches for abbreviation disambiguation such as classification among fixed candidates or pattern-based definition extraction. Because of this reason, we consider the task as search problem and propose an approach with two steps: a) exploring possible full form candidates from various resources and b) choosing most acceptable one among retrieved candidates by ranking. To discover full form candidates and their features, we exploited external academic resources such as MEDLINE and UMLS as well as the clinical note corpus itself. To rank the candidates properly based on human criteria, we adopted Rank Boost, one of the learning-to-rank models developed from information retrieval and machine learning communities. Experimental results show the suggested two-step approach is promising for this kind of task.
  • Keywords
    Unified Modeling Language; dictionaries; document handling; information retrieval; learning (artificial intelligence); medical computing; medical information systems; MEDLINE; Rank Boost; UMLS; abbreviation disambiguation; abbreviation resolution; automatic text processing; biomedical documents; biomedical literature; biomedical texts; candidate exploration; clinical document; clinical note corpus; clinical notes; dictionary; external academic resources; human criteria; information retrieval; learning-to-rank models; machine learning community; pattern-based definition extraction; retrieved candidates; search problem; Communities; Dictionaries; Medical diagnostic imaging; Standards; Text recognition; Unified modeling language; Vocabulary; Abbreviation Resolution; Learning to Rank; Medical Text Processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Healthcare Informatics (ICHI), 2013 IEEE International Conference on
  • Conference_Location
    Philadelphia, PA
  • Type

    conf

  • DOI
    10.1109/ICHI.2013.44
  • Filename
    6680492