Title of article :
Cue-based assertion classification for Swedish clinical text—Developing a lexicon for pyConTextSwe
Author/Authors :
Velupillai، نويسنده , , Sumithra and Skeppstedt، نويسنده , , Maria and Kvist، نويسنده , , Maria and Mowery، نويسنده , , Danielle and Chapman، نويسنده , , Brian E. and Dalianis، نويسنده , , Hercules and Chapman، نويسنده , , Wendy W.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2014
Pages :
8
From page :
137
To page :
144
Abstract :
AbstractObjective ility of a cue-based system to accurately assert whether a disorder is affirmed, negated, or uncertain is dependent, in part, on its cue lexicon. In this paper, we continue our study of porting an assertion system (pyConTextNLP) from English to Swedish (pyConTextSwe) by creating an optimized assertion lexicon for clinical Swedish. s and material egrated cues from four external lexicons, along with generated inflections and combinations. We used subsets of a clinical corpus in Swedish. We applied four assertion classes (definite existence, probable existence, probable negated existence and definite negated existence) and two binary classes (existence yes/no and uncertainty yes/no) to pyConTextSwe. We compared pyConTextSweʹs performance with and without the added cues on a development set, and improved the lexicon further after an error analysis. On a separate evaluation set, we calculated the systemʹs final performance. s ing integration steps, we added 454 cues to pyConTextSwe. The optimized lexicon developed after an error analysis resulted in statistically significant improvements on the development set (83% F-score, overall). The systemʹs final F-scores on an evaluation set were 81% (overall). For the individual assertion classes, F-score results were 88% (definite existence), 81% (probable existence), 55% (probable negated existence), and 63% (definite negated existence). For the binary classifications existence yes/no and uncertainty yes/no, final system performance was 97%/87% and 78%/86% F-score, respectively. sions e successfully ported pyConTextNLP to Swedish (pyConTextSwe). We have created an extensive and useful assertion lexicon for Swedish clinical text, which could form a valuable resource for similar studies, and which is publicly available.
Keywords :
Clinical text mining , Medical language processing , Electronic Health Records , Information extraction , Assertion classification , dictionaries
Journal title :
Artificial Intelligence In Medicine
Serial Year :
2014
Journal title :
Artificial Intelligence In Medicine
Record number :
1841738
Link To Document :
بازگشت