DocumentCode :
2349865
Title :
Result identification for biomedical abstracts using Conditional Random Fields
Author :
Lin, Ryan T.K. ; Dai, Hong-Jei ; Bow, Yue-Yang ; Day, Min-Yuh ; Tzong-Han Tsai, Richard ; Hsu, Wen-Lian
Author_Institution :
Institute of Information Science, Academia Sinica, Taipei, China
fYear :
2008
fDate :
13-15 July 2008
Firstpage :
122
Lastpage :
126
Abstract :
For biomedical research, the most important parts of an abstract are the result and conclusion sections. Some journals divide an abstract into several sections so that readers can easily identify those parts, but others do not. We propose a method that can automatically identify the result and conclusion sections of any biomedical abstracts by formulating this identification problem as a sequence labeling task. Three feature sets (Position, Named Entity, and Word Frequency) are employed with Conditional Random Fields (CRFs) as the underlying machine learning model. Experimental results show that the combination of our proposed feature sets can achieve F-measure, precision, and recall scores of 92.50%, 95.32% and 89.85%, respectively.
Keywords :
Abstracts; Computer science; Frequency; Hidden Markov models; Hypertension; Information science; Machine learning; Support vector machine classification; Support vector machines; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration, 2008. IRI 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV, USA
Print_ISBN :
978-1-4244-2659-1
Electronic_ISBN :
978-1-4244-2660-7
Type :
conf
DOI :
10.1109/IRI.2008.4583016
Filename :
4583016
Link To Document :
بازگشت