Title of article :
Word Sense Disambiguation by Selecting the Best
Semantic Type Based on Journal Descriptor Indexing:
Preliminary Experiment
Author/Authors :
Susanne M. Humphrey، نويسنده , , Willie J. Rogers، نويسنده , , Halil Kilicoglu، نويسنده , , Dina Demner-Fushman، نويسنده , , Thomas C. Rindflesch، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2006
Abstract :
An experiment was performed at the National Library of
Medicine® (NLM®) in word sense disambiguation (WSD)
using the Journal Descriptor Indexing (JDI) methodology.
The motivation is the need to solve the ambiguity problem
confronting NLM’s MetaMap system, which maps
free text to terms corresponding to concepts in NLM’s
Unified Medical Language System® (UMLS®) Metathesaurus
®. If the text maps to more than one Metathesaurus
concept at the same high confidence score, MetaMap has
no way of knowing which concept is the correct mapping.
We describe the JDI methodology, which is ultimately
based on statistical associations between words in a
training set of MEDLINE® citations and a small set of
journal descriptors (assigned by humans to journals per
se) assumed to be inherited by the citations. JDI is the
basis for selecting the best meaning that is correlated to
UMLS semantic types (STs) assigned to ambiguous concepts
in the Metathesaurus. For example, the ambiguity
transport has two meanings: “Biological Transport” assigned
the ST Cell Function and “Patient transport”
assigned the ST Health Care Activity. A JDI-based
methodology can analyze text containing transport and
determine which ST receives a higher score for that text,
which then returns the associated meaning, presumed to
apply to the ambiguity itself. We then present an experiment
in which a baseline disambiguation method was
compared to four versions of JDI in disambiguating 45
ambiguous strings from NLM’s WSD Test Collection.
Overall average precision for the highest-scoring JDI version
was 0.7873 compared to 0.2492 for the baseline
method, and average precision for individual ambiguities
was greater than 0.90 for 23 of them (51%), greater than
0.85 for 24 (53%), and greater than 0.65 for 35 (79%). On
the basis of these results, we hope to improve performance
of JDI and test its use in applications.
Journal title :
Journal of the American Society for Information Science and Technology
Journal title :
Journal of the American Society for Information Science and Technology