DocumentCode :
173952
Title :
Classification ensemble to improve medical Named Entity Recognition
Author :
Keretna, Sara ; Chee Peng Lim ; Creighton, Douglas ; Shaban, Khaled Bashir
Author_Institution :
Centre for Intell. Syst. Res., Deakin Univ., Geelong, VIC, Australia
fYear :
2014
fDate :
5-8 Oct. 2014
Firstpage :
2630
Lastpage :
2636
Abstract :
An accurate Named Entity Recognition (NER) is important for knowledge discovery in text mining. This paper proposes an ensemble machine learning approach to recognise Named Entities (NEs) from unstructured and informal medical text. Specifically, Conditional Random Field (CRF) and Maximum Entropy (ME) classifiers are applied individually to the test data set from the i2b2 2010 medication challenge. Each classifier is trained using a different set of features. The first set focuses on the contextual features of the data, while the second concentrates on the linguistic features of each word. The results of the two classifiers are then combined. The proposed approach achieves an f-score of 81.8%, showing a considerable improvement over the results from CRF and ME classifiers individually which achieve f-scores of 76% and 66.3% for the same data set, respectively.
Keywords :
data mining; learning (artificial intelligence); maximum entropy methods; medical information systems; pattern classification; random processes; text analysis; CRF; ME classifiers; NER; classification ensemble; conditional random field; contextual features; ensemble machine learning approach; informal medical text; knowledge discovery; linguistic features; maximum entropy classifiers; medical named entity recognition; text mining; unstructured medical text; Context modeling; Entropy; Feature extraction; Information retrieval; Testing; Text recognition; Training; Machine learning; biomedical named entity recognition; conditional random field; information extraction; maximum entropy; medical text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics (SMC), 2014 IEEE International Conference on
Conference_Location :
San Diego, CA
Type :
conf
DOI :
10.1109/SMC.2014.6974324
Filename :
6974324
Link To Document :
بازگشت