DocumentCode
3395518
Title
Towards Information Fusion in Pathway Evaluation: Encoding Relations in Biomedical Texts
Author
Dura, E. ; Gawronska, B. ; Olsson, B. ; Erlendsson, B.
Author_Institution
Lexwarel Labs, Univ. of Skovde
fYear
2006
fDate
10-13 July 2006
Firstpage
1
Lastpage
7
Abstract
The long-term goal of the research presented in this paper is to incorporate linguistic text analysis into a system for evaluation of biological pathways. In this system, relations extracted from biomedical texts will be compared with pathways encoded in existing specialized databases. In this way, the biologist´s conclusions regarding the plausibility and/or novelty of a certain relation between genes, proteins, etc., can be supported by fused information from biological databases and biological literature. We aim at overcoming the shortcomings of existing systems for information retrieval by proposing a method based on thorough linguistic analysis of a large text corpus. In this paper, we present a comparative analysis of two corpora: one consisting of biomedical texts from PubMed, the other one of general English prose. The results stress the importance of taking multiword entries into account when constructing a system for extracting biological relations from texts
Keywords
computational linguistics; encoding; information retrieval; knowledge acquisition; medical information systems; natural languages; text analysis; PubMed; biological databases; biomedical texts; encoding; information fusion; information retrieval; linguistic text analysis; pathway evaluation; Bioinformatics; Biological information theory; Biomedical informatics; Data mining; Databases; Encoding; Information analysis; Information retrieval; Proteins; Text analysis; bioinformatics; corpus analysis; information extraction; information retrieval; multiword expressions; relation extraction; text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Fusion, 2006 9th International Conference on
Conference_Location
Florence
Print_ISBN
1-4244-0953-5
Electronic_ISBN
0-9721844-6-5
Type
conf
DOI
10.1109/ICIF.2006.301666
Filename
4085952
Link To Document