DocumentCode :
2369403
Title :
Automated classification of radiology reports for acute lung injury: Comparison of keyword and machine learning based natural language processing approaches
Author :
Solti, Imre ; Cooke, Colin R. ; Xia, Fei ; Wurfel, Mark M.
Author_Institution :
Dept. of Med. Educ. & Biomed. Inf., Univ. of Washington, Seattle, WA, USA
fYear :
2009
fDate :
1-4 Nov. 2009
Firstpage :
314
Lastpage :
319
Abstract :
This paper compares the performance of keyword and machine learning-based chest x-ray report classification for Acute Lung Injury (ALI). ALI mortality is approximately 30 percent. High mortality is, in part, a consequence of delayed manual chest x-ray classification. An automated system could reduce the time to recognize ALI and lead to reductions in mortality. For our study, 96 and 857 chest x-ray reports in two corpora were labeled by domain experts for ALI. We developed a keyword and a Maximum Entropy-based classification system. Word unigram and character n-grams provided the features for the machine learning system. The Maximum Entropy algorithm with character 6-gram achieved the highest performance (Recall=0.91, Precision=0.90 and F-measure=0.91) on the 857-report corpus. This study has shown that for the classification of ALI chest x-ray reports, the machine learning approach is superior to the keyword based system and achieves comparable results to highest performing physician annotators.
Keywords :
diseases; learning (artificial intelligence); medical diagnostic computing; natural language processing; pattern classification; radiology; acute lung injury; character n-grams; chest X-ray report classification; machine learning system; maximum entropy algorithm; maximum entropy-based classification system; natural language processing; radiology reports automated classification; word unigram; Biomedical informatics; Delay; Gold; Injuries; Learning systems; Lungs; Machine learning; Medical diagnostic imaging; Natural language processing; Radiology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine Workshop, 2009. BIBMW 2009. IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-5121-0
Type :
conf
DOI :
10.1109/BIBMW.2009.5332081
Filename :
5332081
Link To Document :
بازگشت