Title :
Identifying Anatomical Phrases in Clinical Reports by Shallow Semantic Parsing Methods
Author :
Bashyam, Vijayaraghavan ; Taira, Ricky K.
Author_Institution :
Med. Imaging Informatics Group, California Univ., Los Angeles, CA
fDate :
March 1 2007-April 5 2007
Abstract :
Natural language processing (NLP) is being applied for several information extraction tasks in the biomedical domain. The unique nature of clinical information requires the need for developing an NLP system designed specifically for the clinical domain. We describe a method to identify semantically coherent phrases within clinical reports. This is an important step towards full syntactic parsing within a clinical NLP system. We use this semantic phrase chunker to identify anatomical phrases within radiology reports related to the genitourinary domain. A discriminative classifier based on support vector machines was used to classify words into one of five phrase classification categories. Training of the classifier was performed using 1000 hand-tagged sentences from a corpus of genitourinary radiology reports. Features used by the classifier include n-grams, syntactic tags and semantic labels. Evaluation was conducted on a blind test set of 250 sentences from the same domain. The system achieved overall performance scores of 0.87 (precision), 0.91 (recall) and 0.89 (balanced f-score). Anatomical phrase extraction can be rapidly and accurately accomplished
Keywords :
classification; grammars; medical computing; natural language processing; radiology; support vector machines; anatomical phrase extraction; classification categories; clinical information extraction; clinical reports; discriminative classifier; genitourinary radiology reports; n-grams; natural language processing; semantic labels; semantic phrase chunker; shallow semantic parsing; support vector machines; syntactic parsing; syntactic tags; Anatomy; Biomedical imaging; Biomedical informatics; Computational intelligence; Data mining; Medical diagnostic imaging; Natural language processing; Radiology; Support vector machines; Urogenital system; Natural language processing; anatomy phrases; radiology reports; shallow semantic parsing; support vector machines;
Conference_Titel :
Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0705-2
DOI :
10.1109/CIDM.2007.368874