DocumentCode :
1990596
Title :
Efficient Methods for Biomedical Named Entity Recognition
Author :
Chan, Shing-Kit ; Lam, Wai
Author_Institution :
Chinese Univ. of Hong Kong, Shatin
fYear :
2007
fDate :
14-17 Oct. 2007
Firstpage :
729
Lastpage :
735
Abstract :
In recent years, conditional random fields (CRFs) have shown good performance in named entity recognition tasks. However, a direct application of it to biomedical named entity recognition incurs a very high training cost. In this paper, we evaluate two alternatives to training a CRF with a traditional single-phase maximum likelihood training method. One is to use an online training method and the other is to divide the named entity recognition task into two tasks. For the cascaded method, we propose to include a "margin" in the model that leads to better recognition results. Both methods give better performance with substantial decrease in training time. In particular, the cascaded method outperforms the best system in the JNLPBA shared task.
Keywords :
cascade systems; information retrieval systems; medical information systems; biomedical named entity recognition; cascaded method; conditional random fields; entity recognition task; online training method; single-phase maximum likelihood training method; Costs; Databases; Hidden Markov models; Iterative algorithms; Labeling; Maximum likelihood estimation; Optimization methods; Parameter estimation; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4244-1509-0
Type :
conf
DOI :
10.1109/BIBE.2007.4375641
Filename :
4375641
Link To Document :
بازگشت