DocumentCode :
2536194
Title :
Discovering Informative Syntactic Relationships between Named Entities in Biomedical Literature
Author :
Appice, Annalisa ; Ceci, Michelangelo ; Loglisci, Corrado
Author_Institution :
Dipt. di Inf., Univ. degli Studi di Bari, Bari, Italy
fYear :
2010
fDate :
11-16 April 2010
Firstpage :
120
Lastpage :
125
Abstract :
The discovery of new and potentially meaningful relationships between named entities in biomedical literature can take great advantage from the application of multirelational data mining approaches in text mining. This is motivated by the peculiarity of multi-relational data mining to be able to express and manipulate relationships between entities. We investigate the application of such an approach to address the task of identifying informative syntactic structures, which are frequent in biomedical abstract corpora. Initially, named entities are annotated in text corpora according to some biomedical dictionary (e.g. MeSH taxonomy). Tagged entities are then integrated in syntactic structures with the role of subject and/or object of the corresponding verb. These structures are represented in a first-order language. Multi-relational approach to frequent pattern discovery allows to identify the verb-based relationships between the named entities which frequently occur in the corpora. Preliminary experiments with a collection of abstracts obtained by querying Medline on a specific disease are reported.
Keywords :
data mining; dictionaries; medical administrative data processing; medical computing; text analysis; MeSH taxonomy; Medline querying; biomedical abstract corpora; biomedical dictionary; biomedical literature; frequent pattern discovery; informative syntactic relationships; multirelational data mining; named entities; text corpora; text mining; verb-based relationships; Abstracts; Data mining; Databases; Dictionaries; Diseases; Proteins; Search methods; Speech; Taxonomy; Text mining; Biomedical Literature; Frequent Pattern Discovery; Multi-Relational Data Mining; Syntactic Text Structure;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Databases Knowledge and Data Applications (DBKDA), 2010 Second International Conference on
Conference_Location :
Menuires
Print_ISBN :
978-1-4244-6081-6
Type :
conf
DOI :
10.1109/DBKDA.2010.14
Filename :
5477135
Link To Document :
بازگشت