DocumentCode :
2691040
Title :
iSimp: A sentence simplification system for biomedicail text
Author :
Yifan Peng ; Tudor, C.O. ; Torii, Manabu ; Wu, Cathy H. ; Vijay-Shanker, K.
Author_Institution :
Comput. & Inf. Sci., Univ. of Delaware, Newark, DE, USA
fYear :
2012
fDate :
4-7 Oct. 2012
Firstpage :
1
Lastpage :
6
Abstract :
Text mining applications using natural language processing are often confronted with long and complicated sentences. This is observed particularly in the abstracts of scientific articles where authors summarize, in few sentences, the various facts described throughout the manuscript. Being rich in novel and important information, the abstract has been the primary target of biomedicai text mining applications. In this work, we aim to simplify complex sentences in abstracts of biomedicai text so that they can be readily processed by text mining applications. We focus on syntactic constructs that are frequently encountered in the biomedicai literature, such as coordinations, relative clauses, and appositions, with emphasis on their boundary detection. Our approach yielded good detection performance (average F-measure between 86.5% and 92.7%), and aided in improving biomedicai text mining applications, RLIMS-P and RankPref.
Keywords :
data acquisition; data mining; medical computing; natural language processing; text analysis; word processing; RLIMS-P; RankPref; abstracts; appositions; biomedicai text mining applications; boundary detection; coordinations; detection performance; iSimp; natural language processing; relative clauses; scientific articles; sentence simplification system; Abstracts; Information retrieval; Natural language processing; Proteins; Substrates; Syntactics; Text mining; information extraction; natural language processing; sentence simplification; text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-1-4673-2559-2
Electronic_ISBN :
978-1-4673-2558-5
Type :
conf
DOI :
10.1109/BIBM.2012.6392671
Filename :
6392671
Link To Document :
بازگشت