Title :
A hybrid approach to recognising discourse causality in the biomedical domain
Author :
Claudiu Mihăilă;Sophia Ananiadou
Author_Institution :
Nat. Centre for Text Min., Univ. of Manchester, Manchester, UK
Abstract :
Whilst current domain-specific information extraction systems represent an important resource for biomedical researchers, the increasing amount of knowledge published daily is still overwhelming them. As such, automatic discourse causality recognition can further improve the search for relevant information by suggesting possible causal connections. We describe here an approach to the automatic recognition of discourse causality in the biomedical domain using a combination of machine learning and rules. We test and evaluate our system on BioCause, a corpus containing gold standard annotations of causal relations. The best performance in identifying triggers is achieved by CRFs with 79.35% F-score. We then locate the arguments using naïve syntactic rules, achieving F-scores of around 90% in most cases. Determining which argument plays which role is performed by a group of machine learners with an F-score of 84.35%.
Keywords :
"Semantics","Syntactics","Support vector machines","Biological system modeling","Unified modeling language","Pipelines","Feature extraction"
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference on
DOI :
10.1109/BIBM.2013.6732519