DocumentCode :
2968603
Title :
A Verb-Centric Approach for Relationship Extraction in Biomedical Text
Author :
Sharma, Abhishek ; Swaminathan, Rajesh ; Yang, Hui
Author_Institution :
Dept. of Comput. Sci., San Francisco State Univ., San Francisco, CA, USA
fYear :
2010
fDate :
22-24 Sept. 2010
Firstpage :
377
Lastpage :
385
Abstract :
Advances in biomedical technology and research have resulted in a large number of research findings, which are primarily published in unstructured text such as journal articles. Text mining techniques have been thus employed to extract knowledge from such data. In this article we focus on the task of identifying and extracting relations between bio-entities such as green tea and breast cancer. Unlike previous work that employs heuristics such as co-occurrence patterns and handcrafted syntactic rules, we propose a verb-centric algorithm. This algorithm identifies and extracts the main verb(s) in a sentence, therefore, it does not require the usage of predefined rules or patterns. Using the main verb(s) it then extracts the two involved entities of a relationship. The biomedical entities are identified using a dependence parse tree by applying syntactic and linguistic features such as preposition phrases and semantic role analysis. The proposed verb-centric approach can effectively handle complex sentence structures such as clauses and conjunctive sentences. We evaluate the algorithm on several data sets and achieve an average F-score of 0.905, which is significantly higher than that of previous work.
Keywords :
data mining; diseases; medical computing; text analysis; biomedical technology; biomedical text; breast cancer; conjunctive sentences; handcrafted syntactic rules; journal articles; relationship extraction; text mining techniques; unstructured text; verb centric approach; Abstracts; Chemicals; Colon; Compounds; Protein engineering; Proteins; Semantics; Biomedical Text Mining; Natural Language Processing (NLP); Relationship Extraction; Verb-centric Method;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantic Computing (ICSC), 2010 IEEE Fourth International Conference on
Conference_Location :
Pittsburgh, PA
Print_ISBN :
978-1-4244-7912-2
Electronic_ISBN :
978-0-7695-4154-9
Type :
conf
DOI :
10.1109/ICSC.2010.14
Filename :
5629120
Link To Document :
بازگشت