DocumentCode :
2950668
Title :
Sentence-based classification of free-text breast cancer radiology reports
Author :
Maghsoodi, Aisan ; Sevenster, Merlijn ; Scholtes, Johannes ; Nalbantov, Georgi
fYear :
2012
fDate :
20-22 June 2012
Firstpage :
1
Lastpage :
4
Abstract :
Radiology reports generally consist of narrative text. It has been envisioned that structured medical content can be leveraged to clinical applications. Text-mining techniques can be utilized to realize this vision. We created a pipeline for automatic sentence classification of narrative breast cancer radiology reports. A corpus of 353 reports and 8166 sentences was annotated with seven sentence classes related to laterality, modality and recommendation. Sentences have been represented by four types of feature sets, characterizing various levels of linguistic complexity and domain knowledge. We conducted an evaluation to find the optimal combination of features and the optimal classification paradigm. The classification accuracy ranges between 92 and 98% for the different classes.
Keywords :
cancer; data mining; medical computing; radiology; text analysis; automatic sentence classification; domain knowledge; free-text breast cancer radiology reports; linguistic complexity; narrative breast cancer radiology; sentence-based classification; text-mining techniques; Accuracy; Breast; Feature extraction; Machine learning; Radiology; Sections; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer-Based Medical Systems (CBMS), 2012 25th International Symposium on
Conference_Location :
Rome
ISSN :
1063-7125
Print_ISBN :
978-1-4673-2049-8
Type :
conf
DOI :
10.1109/CBMS.2012.6266374
Filename :
6266374
Link To Document :
بازگشت